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Protein Glycosylation Modification in Pichia Pastoris 

Field of the Invention 

The present invention relates to methods and vectors useful for genetically 
5 modifying the glycosylation process in methylotrophic yeast strains for the purpose of 
producing glycoproteins with reduced glycosylation. The present invention further 
relates to methylotrophic yeast strains generated using the present methods and 
vectors, as well as glycoproteins produced from such genetically modified strains. 

10 Background of the Invention 

The methylotrophic yeasts including Pichia pastoris have been widely used 
for production of recombinant proteins of commercial or medical importance. 
However, production and medical applications of some therapeutic glycoproteins can 
be hampered by the differences in the protein-linked carbohydrate biosynthesis 

15 between these yeasts and the target organism such as a mammalian subject. 

Protein N-glycosylation originates in the endoplasmic reticulum (ER), 
where an N-linked oligosaccharide (Glc 3 Man 9 GlcNAc 2 ) assembled on dolichol (a lipid 
carrier intermediate) is transferred to the appropriate Asn of a nascent protein. This is 
an event common to all eukaryotic N-linked glycoproteins. The three glucose residues 

20 and one specific a- 1 ,2-1 inked mannose residue are removed by specific glucosidases 
and an a-l,2-mannosidase in the ER, resulting in the core oligosaccharide structure, 
Man 8 GlcNAc 2 . The protein with this core sugar structure is transported to the Golgi 
apparatus where the sugar moiety undergoes various modifications. There are 
significant differences in the modifications of the sugar chain in the Golgi apparatus 

25 between yeast and higher eukaryotes. 

In mammalian cells, the modification of the sugar chain proceeds via 3 
different pathways depending on the protein moiety to which it is added. That is, (1) 
the core sugar chain does not change; (2) the core sugar chain is changed by adding 
the N-acetylglucosamine-l-phosphate moiety (GlcNAc-l-P) in UDP-N-acetyl 
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glucosamine (UDP-GlcNAc) to the 6-position of mannose in the core sugar chain, 

followed by removing the GlcNAc moiety to form an acidic sugar chain in the 
glycoprotein; or (3) the core sugar chain is first converted into Man 5 GIcNAc 2 by 
removing 3 mannose residues with mannosidase I; MansGlcNAc 2 is further modified 
5 by adding GlcNAc and removing 2 more mannose residues, followed by sequentially 
adding GlcNAc, galactose (Gal), and N-acetylneuraminic acid (also called sialic acid 
(NeuNAc)) to form various hybrid or complex sugar chains (R. Kornfeld and S. 
Kornfeld s ^m;. Rev. Biochem. 54: 631-664, 1985; Chiba et al J. Biol. Chem. 273: 
26298-26304, 1998). 

1 0 ln y east > the - modification of the sugar chain in the Golgi involves a series 

of additions of mannose residues by different mannosyltransferases ("outer chain" 
glycosylation). The structure of the outer chain glycosylation is specific to the 
organisms, typically with more than 50 mannose residues in S. cerevisiae, and most 
commonly with structures smaller than Man 14 GlcNAc 2 in Pichia pastoris. This yeast- 

1 5 specific outer chain glycosylation of the high mannose type is also denoted 
hyperglycosylation. 

Hyperglycosylation is often undesired since it leads to heterogeneity of a 
recombinant protein product in both carbohydrate composition and molecular weight, 
which may complicate the protein purification. The specific activity (unitsAveight) of 

20 hyperglycosylated enzymes may be lowered by the increased portion of carbohydrate. 
In addition, the outer chain glycosylation is strongly immunogenic which is 
undesirable in a therapeutic application. Moreover, the large outer chain sugar can 
mask the immunogenic determinants of a therapeutic protein. For example, the 
influenza neuraminidase (NA) expressed in P. pastoris is glycosylated with N-glycans 

25 containing up to 30-40 mannose residues. The hyperglycosylated NA has a reduced 
immunogenicity in mice, as the variable and immunodominant surface loops on top of 
the NA molecule are masked by the N-glycans (Martinet et al. Eur J. Biochem. 247: 
332-338, 1997). 
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Therefore, it is desirable to genetically engineer methylotrophic yeast 
strains in which glycosylation of proteins can be manipulated and from which 
recombinant proteins can be produced that would not be compromised in structure or 
function by large N-gly can side chains. 

5 

Summary of the Invention 

The present invention is directed to methods and vectors useful for 
genetically modifying the glycosylation process in methylotrophic yeast strains to 
produce glycoproteins with reduced glycosylation. Methylotrophic yeast strains 

10 generated using the present methods and vectors, as well as glycoproteins produced 
from such genetically modified strains are also provided. 

In one embodiment, the present invention provides vectors useful for 
making genetically engineered methylotrophic yeast strains which are capable of 
producing glycoproteins with reduced glycosylation. 

15 In one aspect, the present invention provides "knock-in" vectors which are 

capable of expressing in a methylotrophic yeast strain one or more proteins whose 
enzymatic activities lead to a reduction of glycosylation in glycoproteins produced by 
the methylotrophic yeast strain. 

In a preferred embodiment, the knock-in vectors of the present invention 

20 include a nucleotide sequence coding for an a- 1,2-mannosidase or a functional part 

thereof and are capable of expressing the a- 1,2-mannosidase or the functional part in a 
methylotrophic yeast strain. A preferred nucleotide sequence is a nucleotide sequence 
encoding the a-l,2-mannosidase of a fungal species, and more preferably, 
Trichodermareesei. Preferably, the a- 1,2-mannosidase expression vector is 

25 engineered such that the a- 1,2-mannosidase or a functional part thereof expressed 
from the vector includes an ER-retention signal. A preferred ER-retention signal is 
HDEL. The a- 1,2-mannosidase coding sequence can be operable linked to a 
constitutive or inducible promoter, and a 3' termination sequence. The vectors can be 

>. ■ 
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integrative vectors or replicative vectors. Particularly preferred a-l,2-mannosidase 
expression vectors include p GAPZMFManHDEL, pGAPZMFManMycHDEL, 
pPICZBMFManMycHDEL, pGAPZmManHDEL, pGAPZmMycManHDEL, 
pPIC9mMycManHDEL and pGAPZmMycManHDEL. 

In another preferred embodiment, the knock-in vectors of the present 
invention include a sequence coding for a glucosidase II or a functional part thereof 
and are capable of expressing the glucosidase II or the functional part in a 
methylotrophic yeast strain. A preferred nucleotide sequence is a nucleotide sequenc 
encoding the glucosidase II of a fungal species, and more preferably, Saccharomyces 
cerevisiae. Preferably/the glucosidase II expression vector is engineered such that the 
glucosidase E or a functional part thereof expressed from the vector includes an ER- 
retention signal. A preferred ER-retention signal is HDEL. The glucosidase n coding 
sequence can be operable linked to a constitutive or inducible promoter, and a 3' 
termination sequence. The vectors can be integrative vectors or replicative vectors. 
Particularly preferred glucosidase II expression Vectors include pGAPZAGLSII, 
pPICZAGLSII, pAOX2ZAGLSII, pYPTIZAGLSII, pGAPADEglsII, pPICADEglsII, 
P AOX2ADEglsII, pYPTIADEglsII, pGAPZAglsIIHDEL and pGAPADEglsIIHDEL. 

Expression vectors which include both of an cc-l,2-mannosidase expression 
unit and a glucosidase II expression unit are also provided by the present invention. 

In another aspect, the present invention provides "knock-out" vectors 
which, when introduced into a methylotrophic yeast strain, inactivate or disrupt a gene 
thereby facilitating the reduction in the glycosylation of glycoproteins produced in the 
methylotrophic yeast strain. 

In one embodiment, the present invention provides a "knock-out" vector 
which, when introduced into a methylotrophic yeast strain, inactivates or disrupts the 
Ochl gene. A preferred Ochl knock-out vector is pBLURA5'PpOCHl . 

Still another embodiment of the present invention provides vectors which 
include both a knock-in unit and a knock-out unit. 
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Furthermore, any of the knock-in or knock-out vectors of the present 
invention can also include a nucleotide sequence capable of expressing a heterologous 
protein of interest in a methylotrophic yeast 

Another embodiment of the present invention provides methods of 
5 modifying the glycosylation in a methylotrophic yeast by transforming the yeast with 
one or more vectors of the present invention. 

Strains of a methylotrophic yeast which can be modified using the present 
methods include, but are not limited to, yeast strains capable of growth on methanol 
such as yeasts of the genera Candida, Hansemtla, Torulopsis, and Pichia, Preferred 
10 methylotrophic yeasts are of the genus Pichia. Especially preferred are Pichia 

pastoris strains GS115 (NRRL Y-15851), GS190 (NRRL Y-18014), PPF1 (NRRL Y- 
1 801 7), PPY120H, yGC4, and strains derived therefrom. Methylotrophic yeast strains 
which can be modified using the present methods also include those methylotrophic 
yeast strains which have been engineered to express one or more heterologous proteins 
15 of interest. The glycosylation on the heterologous proteins expressed from these - 

previously genetically engineered strains can be reduced by transforming such strains 
with one or more of the vectors of the present invention 

Methylotrophic yeast strains which are modified by practicing the present 
methods are provided in another embodiment of the present invention. 
20 A further aspect of the present invention is directed to methods of 

producing glycoproteins with a reduced glycosylation. 

In accordance with such methods, a nucleotide sequence capable of 
expressing a glycoprotein can be introduced into a methylotrophic yeast strain which 
has previously been transformed with one or more of the vectors of the present 
25 invention. Alternatively, a methylotrophic yeast strain which has been genetically 
engineered to express a glycoprotein can be transformed with one or more of the 
vectors of the present invention. Moreover, if a methylotrophic yeast strain is not 
transformed with a nucleotide sequence encoding a glycoprotein of interest or any of 
the vectors of the present invention, such yeast strain can be transformed, either 

5 



WO 02/00856 



PCT/IB01/01514 



10 



15 



20 



25 



consecutively or simultaneously, with both a nucleotide sequence capable of 
expressing the glycoprotein and one or more vectors of the present invention. 
Additionally, a methylotrophic yeast strain can be transformed with one or more of the 
present knock-in and/or knock-out vectors which also include a nucleotide sequence 
capable of expressing a glycoprotein in the methylotrophic yeast strain. 

Glycoproteins products produced by using the methods of the present 
invention, i.e., glycoproteins with reduced N-glycosylation, are also part of the present 
invention. 

Kits which include one or more of the vectors of the present invention, or 
one or more strains modified to produce glycoproteins with reduced glycosylatibn, are 
also provided. 

Brief Description of the Drawings 

Figure 1 depicts vectors carrying an HDEL-tagged Trichoderma reesei cc- 
1 ,2-mannosidase expression cassette and describes the way in which these vectors 
were constructed according to methods known in the art. Abbreviations used 
throughout construction schemes: 5' AOX1 or AOX1 P: Pichia pastoris AOX1 
promoter sequence; Amp R: ampicillin resistance gene; ColEl: ColEl origin of. 
replication; 3'AOXl: 3' sequences of the Pichia pastoris AOX1 gene; HIS4: HIS4 
gene of Pichia pastoris. AOX TT: transcription terminator sequence of the Pichia 
pastoris AOX1 gene; ORF: open reading frame; S: secretion signal; P TEF1: the 
promoter sequence of the Saccharomyces cerevisiae transcription elongation factor 1 
gene; P EM7: synthetic constitutive prokaryotic promoter EM7; Zeocin: Zeocin 
resistance gene; CYC 1 TT: 3' end of the S, cerevisiae CYCl gene; GAP: promoter 
sequence of the Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase gene; 
PpURA3: Pichia pastoris URA3 gene. As can be seen in this figure, the Trichoderma 
reesei a- 1 ,2-mannosidase was operably linked to the coding sequence for the S. 
cerevisiae a-mating factor secretion signal sequence and further operably linked at the 
3' terminus of the coding sequence to the coding sequence for an HDEL peptide. The 
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whole fusion construct was operably linked to either the P. pastoris AOX1 promoter 
(in pPIC9MFManHDEL) or to the P. pastoris GAP promotor (in 
pGAPZMFManHDEL). 

Figure 2 depicts vectors carrying an HDEL-tagged Mm musculus a-1,2- 
mannosidase IB expression cassette and describes the way in which these vectors were 
constructed according to methods known in the art. As can be seen in this figure, the 
catalytic domain of the Mus musculus a-l,2-mannosidase IB was operably linked to 
the coding sequence for the S. cerevisiae a-mating factor secretion signal sequence 
and further operably linked at the 3' terminus of the coding sequence to the coding 
sequence for an HDEL peptide. The whole fusion construct was operably linked to 
either the P. pastoris AOX1 promoter (in pPIC9mManHDEL) or to the P. pastoris 
GAP promotor (in pG APZmManHDEL) . Furthermore, variants of the expression 
cassette were made in which the coding sequence for a cMyc epitope tag was inserted 
between the coding sequence for the S. cerevisiae a-Mating Factor secretion signal 
sequence and the coding sequence for the catalytic domain of the Mus musculus cc-1,2- 
mannosidase IB. This expression cassette was also operably linked to either the P. 
pastoris AOX1 promoter (in pPIC9mMycManHDEL) or to the P. pastoris GAP 
promotor (in pGAPZmMycManHDEL). 

Figure 3 depicts vectors carrying a MycHDEL tagged Trichoderma reesei 
a-l,2-mannosidase and the way in which these vectors were obtained. The resulting 
fusion construction was again operably linked to either the P. pastoris AOX1 
promoter (in pPICZBMFManMycHDEL) or to the P pastoris GAP promotor (in 
pGAPZMFManMycHDEL). 

Figure 4 demonstrates the intracellular localization of the MycHDEL- 
tagged Trichoderma reesei a-l 5 2-mannosidase and indicates ER-targeting by 
immunofluorescence analysis. Panel A Western blotting. Yeast strains were grown in 
10 ml YPG cultures to an OD 6 oo= 10, diluted fivefold and grown in YPM for 48 h. 
I/50th of the culture medium and l/65th of the cells were analysed by SDS-PAGE and 
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Western blotting with the mouse monoclonal 9E10 anti-Myc antibody. The position 
of molecular weight marker proteins are indicated with arrows. Lanes 1-5: cellular 
lysates. 1,2: pGAPZMFManMycHDEL transformants. 3: non-transformed PPY120H 
(negative control). 4,5: pPICZBMFManMycHDEL transformants. Lanes 6-10: 
5 culture media. 6: non transfonned PPY120H (negative control). 7,8: 

pGAPZMFManMycHDEL transfonnants. 9, 1 0: pPICZBMFManMycHDEL 
transformants. Panel B Immunofluorescence microscopy. 1 : phase contrast image of 
a P. pastoris cell (strain PPY120H transformed with pGAPZMFManHDEL) at lOOOx 
magnification. The nucleus is visible as an ellipse in the lower right quadrant of the 
10 cell. 2: same cell as in 1, but in fluorescence microscopy mode to show localization of 
the T. reesei mannosidase-Myc-HDEL protein. The protein is mainly localized in a 
circular distribution around the nucleus (nuclear envelope), which is typical for an 
endoplasmic reticulum steady-state distribution. 3: phase contrast image of a P. 
pastoris cell (strain PPY120H transformed with pGAPZMFManHDEL) at 1 OOOx 
1 5 magnification. 4: same cell in fluorescence microscopy to show localization of the 
Golgi marker protein OCH1-HA in P. pastoris strain PPY120H. The dot-like 
distribution throughout the cytoplasm, with 3-4 dots per cell is typical for cis-Golgi 
distribution in P. pastoris. 

Figure 5 depicts the co-sedimentation of mannosidase-MycHDEL with 
20 Protein Disulfide Isomerase in sucrose density gradient centrifiigation. The top panel 
shows the distribution over the different fractions of the sucrose gradient of the 
OCH1-HA Golgi marker protein. The middle panel shows this distribution for the 
Protein Disulfide Isomerase endoplasmic reticulum marker protein. Finally, the 
bottom panel shows the distribution of the MycHDEL-tagged Trichodenna reesei cc- 
25 1 ,2-mannosidase over the same fractions. It is concluded that the mannosidase- 
MycHDEL almost exactly matches the distribution of the ER marker PDI and thus 
mainly resides in the ER of the Pichia pastoris yeast cells. 

Figure 6 depicts the N-glycan analysis of Tiypanosoma cnizi trans- 
sialidase coexpressed with Trichoderma reesei mannosidase-HDEL. Panel A: malto- 
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oligosaccharide size reference ladder. Sizes of the glycans are expressed in Glucose 
Units (GU) by comparison of their electrophoretic mobility to the mobility of these 
malto-oligosaccharides. Panel B: N-glycans derived from recombinant Tiypanosoma 
cruzi Yraw-sialidase expressed m Pichia pastoris. The peak at GU=9,2 corresponds to 
Man 8 GlcNAc 2 . Panel C: same analytes as panel 2, but after overnight treatment with 
3U/ml purified recombinant T. reesei a-l,2-mannosidase. Panel D: N-glycans 
derived from recombinant /ra«.y-sialidase co-expressed in Pichiapastoris with T. 
reesei mannosidase-HDEL (under control of the GAP promotor). The peak at 
GU=7,6 corresponds to the Man 5 GlcNAc 2 peak in the profile of RNase B (Panel F). 
Panel E: same analytes as panel D, but after overnight treatment with 3 mU/ml 
purified recombinant T. reesei a-l,2-mannosidase. Panel F: N-glycans derived from 
bovine RNase B. These glycans consist of Man 5 GlcNAc2 to Man 8 GlcNAc 2 . Different 
isomers are resolved, accounting for the number of peaks for Man 7 GIcNAc 2 . 

Figure 7 depicts the processing of influenza haemagglutinin N-glycans by 
HDEL-tagged Trichoderma reesei a-l,2-mannosidase and the HDEL-tagged catalytic 
domain of murine a-1 ,2-mannosidase IB. The Man 5 GlcNAc 2 reference 
oligosaccharide runs at scan 1850 in this analysis (not shown). Panel 1 : malto- 
oligosaccharide size reference ladder. Panel 2: N-glycans derived from recombinant 
influenza haemagglutinin expressed in Pichia pastoris. The peak at scan 2250 
corresponds to Man 9 GlcNAc 2 . Panel 3: N-glycans derived from recombinant 
haemagglutinin co-expressed in Pichia pastoris with T. reesei mannosidase-HDEL 
(under control of the GAP promotor). The peak at scan 1 950 corresponds to 
Man 6 GlcNAc 2 . Panel 4: Same analytes as for panel 3, but after overnight treatment 
with 30 mU purified recombinant T. reesei cc-1 3 2-mannosidase. Panel 5: N-glycaris 
derived from recombinant haemagglutinin co-expressed in Pichia pastoris with mouse 
mannosidase IB-HDEL (under control of the GAP promotor). Panel 6: same analytes 
as for panel 5, but after overnight treatment with 30 mU purified recombinant T. reesei 
a- 1 s 2-mannosidase. 
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Figure 8 graphically depicts vector pBLURA5'PpOCH 1 and the way in 
which it was constructed. 

Figure 9 depicts the scheme for disrupting the Pichia pastoris OCH1 gene 
by single homologous recombination using pBLURA5'PpOCHl. 
5 Figure 10 depicts the cell wall glycoprotein N-glycan analysis of the 

Ochl -inactivated clone and three clones derived from this Ochl -inactivated clone by 
transformation with pGAPZMFManHDEL. Panel 1 shows the analysis of a mixture 
of malto-oligosaccharides, the degree of polymerisation of which is given by the 
numbers on the very top of the figure. This analysis serves as a size reference for the 
1 0 other panels. On the vertical axis of all panels, peak intensity in relative fluorescence 
units is given. Panel 2-6: analysis of the cell wall glycoprotein N-glycans of the 
following strains: Panel 2, non-engineered P. pastoris strain yGC4; Panel 3, yGC4 
transformed with pBLURA5*PpOchl; 4-6, three clones of the strain of Panel 3, 
supplementarily transformed with pGAPZMFManHDEL. Panel 7: the N-glycans 
15 derived from bovine RNaseB, consisting of a mixture of Man 5 -9GlcNAc 2 . As can be 
seen from comparison between panel 2 and 3 and reference to panel 7, transformation 
with pBLURA5'PpOchl leads to a strongly increased abundance of the Man 8 GlcNAc 2 
substrate N-glycan (named peak 1 in Panel 2) of OCHlp. Peak 2 represents the 
Man 9 GlcNAc 2 product of OCH lp. Furthermore, upon supplementary transformation 
20 of pGAPZMFManHDEL, the major glycan on the cell wall glycoproteins of three 
independent clones is the Man 5 GlcNAc 2 end product (peak 3 in panel 4) of T. reesei 
oc-l,2-mannosidase digestion of the Man g GlcNAc 2 substrate. 

. Figure 11 depicts the analysis of exactly the same glycan mixtures as in 
Figure 10, but after an in vitro digest with 3mU/ml purified Trichoderma reesei a- 1,2- 
25 mahnosidase, overnight in 20 itiM sodium acetate pH=5.0. Axis assignment is the 
same as in Figure 10. More Man 3 GlcNAc 2 is formed in the pBLURAS'PpOchl 
transformed strain (Panel 3) than in the parent strain (Panel 2). Peaks in all panels 
before scan 3900 come from contaminants and should be ignored in the analysis. 



10 



WO 02/00856 PCT/IB01/01514 



Figure 12 depicts the expression vector pGAPZAGLSII (SEQ ID NO: 1 8). 
P TEF1 : promotor of S. cerevisiae transcription elongation factor gene. P Em7: 
synthetic prokaryotic promotor. Zeocin: zeocine resistance marker gene CYC1.TT: 
transcription terminator of S. cerevisiae cytochrome CI gene. Col El : bacterial origin 
5 of replication. GAP: promotor of the P. pasttoris GAP gene. GLS2: S. cerevisiae 

glucosidase II gene. AOX1 TT: transcription terminator of the P. pastoris AOX1 gene 

Figure 13 depicts the expression vector pAOX2ZAGLSII (SEQ ID NO: 
16). P TEF1 : promotor of S. cerevisiae transcription elongation factor gene. P Em7: 

10 synthetic prokaryotic promotor. Zeocin: zeocine resistance marker gene. CYC1 TT: 
transcription terminator of S. cerevisiae cytochrome C 1 gene. Col El : bacterial origin 
of replication. AOX2 P: promotor of the P. pastoris AOX2 gene. GLS2: S. cerevisiae 
glucosidase II gene. AOX1 TT: transcription terminator of the P. pastoris AOX1 gene 
Figure 14 depicts the expression vector pPICZAGLSII (SEQ ID NO: 20). 

15 P TEF1 : promotor of S. cerevisiae transcription elongation factor gene. P Em7: 

synthetic prokaryotic promotor. Zeocin: zeocine resistance marker gene. CYC1 TT: 
transcription terminator of S. cerevisiae cytochrome CI gene. Col El : origin of 
replication. AOX1 P: promotor of the P. pastoris AOX1 gene. GLS2: S. cerevisiae 
glucosidase II gene. AOX1 TT: transcription terminator of the P. pastoris AOX1 gene 

20 Figure 15 depicts the expression vector pYPTlZAGLSII ((SEQ ID NO: 

22). P TEF1 : promotor of S. cerevisiae transcription elongation factor gene. P Em7: 
synthetic prokaryotic promotor. Zeocin: zeocine resistance marker gene. CYC1 TT: 
transcription terminator of S. cerevisiae cytochrome CI gene. Col El : origin of 
replication. P YPTl: promotor of the P. pastoris YPT1 gene. GLS2: S. cerevisiae 

25 glucosidase II gene. AOX1 TT: transcription terminator of the P. pastoris AOX1 
gene. 

Figure 16 depicts the expression vector pGAPADElglsII (SEQ ID NO: 
19). Amp R: Ampiilicin resistance marker gene. ADE1: P. pastoris ADE1 selection 

ii 
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marker gene. GAP: promotor of the P. Pastoris GAP gene. GLS2: S. cerevisiae 
glucosidase II gene. AOX1 TT: transcription terminator of the P. pastoris AOX1 gene 

Figure 17 depicts the expression vector pAOX2ADElglsII (SEQ ED NO: 
17). Amp R: Ampillicin resistance marker gene. ADE1 : P. pastoris ADE1 selection 
marker gene. AOX2 P: promotor of the P. pastoris AOX2 gene. GLS2: S. cerevisiae 
glucosidase II gene. AOX1 TT: transcription terminator of the P. pastoris AOX1 - 
gene. 

Figure 18 depicts the expression vector pPICADElglsII (SEQ ID NO: 21). 
Amp R: Ampillicin resistance marker gene. ADE1: P. pastoris ADE1 selection 
marker gene. AOX1 P: promotor of the P. pastoris AOX1 gene. GLS2: S. cerevisiae 
glucosidase II gene. AOX1 TT: transcription terminator of the P. pastoris AOX1 
gene. 

Figure 19 depicts the expression vector pYPTl ADElglsII (SEQ ID NO: 

23) . Amp R: Ampillicin resistance marker gene. ADE1: P. pastoris ADE1 selection 
15 marker gene. P YPT1 : promotor of the P. pastoris YPT1 gene. GLS2: S. cerevisiae 

glucosidase II gene. AOX1 TT: transcription terminator of the P. pastoris AOX1 
gene. 

Figure 20 depicts the expression vector pGAPZAglsIIHDEL (SEQ ID NO: 

24) . Amp R: Ampillicin resistance marker gene. ADE1: P. pastoris ADE1 selection 
20 marker gene. GAP: promotor of the P. pastoris GAP gene. GLS2: S. cerevisiae 

glucosidase II gene. AOX1 TT: transcription terminator of the P. pastoris AOX1 
gene. 

Figure 21 depicts the expression vector pGAPADElglsIIHDEL (SEQ ID 
NO: 25). P TEFi : promotorof S. cerevisiae transcription elongation factor gene. P 
25 Em7: synthetic prokaryotic promotor. Zeocin: zeocine resistance marker gene. CYC1 
TT: transcription terminator of S. cerevisiae cytochrome Gl gene. Col El: bacterial 
origin of replication. GAP: promotor of the P. pastoris GAP gene. GLS2: S. 
cerevisiae glucosidase II gene. AOX1 TT: transcription terminator of the P. pastoris 
AOX1 gene. 

12 
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Figure 22 depicts the test of the GLSII activity assay using a commercially 
available yeast alpha-glucosidase (Sigma: Cat. No. G-5003). The assay mixture 
contains phosphate-citrate buffer pH 6.8, mannose, 2-deoxy-D-glucose, the substrate 
4-methylumbeIlyferyl-alpha-D-glucopyranoside and alpha-glucosidase from Sigma. 1: 
5 assay mixture illuminated with UV-light after overnight incubation at 37 ° C; 2: same 
as 1, but this time, the assay mixture lacks the alpha-glucosidase; 3: same as 1, but this 
time, the assay mixture lacks the substrate. 

Figure 23 depicts the results of the activity of recombinantly expressed 
GLSII from Pichiapastoris. All assay mixtures were incubated overnight at 37 °C 
1 0 and afterwards illuminated with UV-light. 1 : assay with yeast alpha-glucosidase 

(Sigma: Cat. No. G-5003); 2: assay with the purified medium of strain 18 (PPY12-OH 
transformed with pGAPZAGLSII); 3: assay with purified medium of the WT PPY12- 
OH strain; 4: assay with the purified medium of strain H3 (PPY12-OH transformed 
with pGAPZAglsIIHDEL). 
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Detailed Description of the Invention 

It has been established that the majority of N-glycans on glycoproteins 
leaving the endoplasmic reticulum (ER) of Pichia have the core Man 8 GlcNAc 2 
oligosaccharide structure. After the proteins are transported from the ER to the Golgi 
20 apparatus, additional mannose residues are added to this core sugar moiety by 

different mannosyltransferases, resulting in glycoproteins with large mannose side 
chains. Such hyperglycosylation of recombinant glycoproteins is undesirable in many 
instances. Accordingly, the present invention provides methods and vectors for 

genetically modifying methylotrophic yeast strains to produce glycoproteins with 
25 reduced glycosylation. Methylotrophic yeast strains generated using the present 
methods and vectors, as well as glycoproteins produced from such genetically 
modified strains are also provided. 
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. In one embodiment, the present invention provides vectors useful for 
genetically modifying methylotrophic yeast strains to produce glycoproteins with 
reduced glycosylation. 

In one aspect, the present invention provides "knock-in" vectors which are 
5 capable of expressing in a methylotrophic yeast strain one or more proteins whose 
enzymatic activities lead to a reduction of glycosylation in glycoproteins produced by 
the methylotrophic yeast strain. According to the present invention, such proteins 
include, e.g., an ot-l,2-mannosidase, a glucosidase II, or functional parts thereof. 

In a preferred embodiment, the vectors of the present invention include a 
10 sequence coding for an ct-l,2-mannqsidase or a functional part thereof and are capable 
of expressing the a- 1 ,2-mannosidase or the functional part in a methylotrophic yeast 
strain. 

An cc-l,2-mannosidase cleaves the a-l,2-linked mannose residues at the 
non-reducing ends of Man 8 GIcNAc 2 , and converts this core oligosaccharide on .* 

15 ' glycoproteins to Man 5 GlcNAc 2 . In vitro, Man 5 GlcNAc 2 is a very poor substrate for 
any Pichia Golgi mannosyltransferase, i.e., mannose residues can not be added to this 
sugar structure. On the other hand, Man 5 GlcNAc 2 is the acceptor substrate for the 
mammalian N-acetylglucosaminyl-transferase I and is an intermediate for the hybrid- 
and complex-type sugar chains characteristic of mammalian glycoproteins. Thus, by 

20 way of introducing an a-l,2-mannosidase into methylotrophic yeasts such as Pichia, 
glycoproteins with reduced mannose content can be produced. 

According to the present invention, the nucleotide sequence encoding an a- 
1 ,2-mannosidase for use in the expression vector of the present invention can derive 
from any species. A number of a-l,2-mannosidase genes have been cloned and are 

25 available to those skilled in the art, including mammalian genes encoding, e.g., a 

murine a- 1 ,2-mannosidase (Herscovics etal.J. Biol Chern. 269: 9864-9871, 1994), a 
rabbit a-l,2-mannosidase (Lai et al. J, Biol Chem. 269: 9872-9881, 1994) or a human 
<x-l ,2-mannosidase (Tremblay et al. Glycobiology 8: 585-595, 1998), as well as fungal 
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genes encoding, e.g., an Aspergillus a-l,2-mannosidase (msdS gene), a Trichoderma 
reesei oc-l,2-mannosidase (Maras et al. J, Biotechnol 77: 255-263, 2000), or a 
Saccharomyces cerevisiae a-l,2-mannosidase. Protein sequence analysis has revealed 
a high degree of conservation among the eukaryotic a-l,2-mannosidases identified so 
5 far, 

Preferably, the nucleotide sequence for use in the present vectors encodes a 
fungal a-l,2-mannosidase, more preferably, a Trichoderma reesei a-l,2-mannosidase, 
and more particularly, the Trichoderma reesei ct-l,2-mannosidase described by Maras 
et al. J. Biotechnol 77: 255-63 (2000). 

1 0 According to the present invention, the nucleotide sequence can also code 

for only a functional part of an a-1 ,2-mannosidase. 

By "functional part" is meant a polypeptide fragment of an a-1,2- 
mannosidase which substantially retains the enzymatic activity of the full-length 
protein. By ''substantially" is meant at least about 40%, or preferably, at least 50% or 

15 more of the enzymatic activity of the full-length a-1 ,2-mannosidase is retained. For 
example, as illustrated by the present invention, the catalytic domain of the murine oc- 
1 ,2-mannosidase IB constitutes a "functional part" of the murine a-l,2-mannosidase 
IB. Those skilled in the art can readily identify and make functional parts of an a- 1,2- 
mannosidase using a combination of techniques known in the art. Predictions of the 

20 portions of an a-l,2-mannosidase essential to or sufficient to confer the enzymatic 
activity can be made based on analysis of the protein sequence. The activity of a 
portion of an a-l,2-mannosidase of interest, expressed and purified from an 
appropriate expression system, can be verified using in vitro or in vivo assays 
described hereinbelow. 

25 In accordance with the present invention, an a-1 ,2-mannosidase or a 

functional part thereof expressed in a methylotrophic yeast strain preferably is targeted 
to a site in the secretory pathway where Man 8 GlcNAc 2 (the substrate of a-1, 2- 



15 



WO 02/00856 



PCT/IBOl/01514 



10 



15 



mannosidase) is already formed on a glycoprotein, but has not reached a Golgi 
glycosyltransferase which elongates the sugar chain with additioAal mannose residues. 

Accordingly, in a preferred embodiment of the present invention, the cc- 
1,2-mannosidase expression vector is engineered as such that the a- 1 ,2-mannosidase 
or a functional part thereof expressed from the vector includes an ER-retention signal. 

"An ER retention signal" refers to a peptide sequence which directs a 
protein having such peptide sequence to be transported to and retained in the ER. 
Such ER retention sequences are often found in proteins that reside and function in the 



ER. 



Multiple choices of ER retention signals are available to those skilled in the 
art, e.g., the first 21 amino acid residues of the S. cerevisiae • ER protein MNS1 
(Martinet et al. Biotechnology Letters 20: 1 171-1 177, 1998). A preferred ER retention 
signal for use in the present invention is peptide HDEL (SEQ ID NO: 1). The HDEL 
peptide sequence, found in the C-terminus of a number of yeast proteins, acts as a 
retention/retrieval signal for the ER (Pelham EMBO J. 7: 913-918,' 1988). Proteins 

with an HDEL sequence are bound by a membrane-bound receptor (Erd2p) and then 
enter a retrograde transport pathway for return to the ER from the Golgi apparatus. 

According to the present invention, an ER retention signal can be placed 
anywhere in the protein sequence of an a-l,2-mannosidase, but preferably at the C- 
20 terminus of the a- 1,2-mannosidase. 

The a- 1 ,2-mannosidase for use in the present invention can be further 
modified, e.g., by insertion of an epitope tag to which antibodies are available, such as 
Myc, HA, FLAG and His6 tags well-known in the art. An epitope-tagged ix- 1 ,2- 
mannosidase can be conveniently purified, or monitored for both expression and 
25 intracellular localization. 

An ER retention signal and an epitope tag can be readily introduced into a 
protein of interest by inserting nucleotide sequences coding for such signal or tag into 
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the nucleotide sequence encoding the protein of interest, using any of the molecular 
biology techniques known in the art. 

In another preferred embodiment, the vectors of the present invent 
include a sequence coding for a glucosidase II or a functional part thereof and are 
5 capable of expressing the glucosidase II or the functional part in the methylotrophic 
yeast strain. 

It has been established that the initial N-linked oligosaccharide 
(Glc 3 Man 9 GlcNAc2), transferred in the ER onto a protein, is cleaved in the ER by 
specific glucosidases to remove the glucose residues, and by a mannosidase to remove 

10 one specific a-l,2-Iinked mannose. It has been observed by the present inventors that 
some recombinant proteins expressed in Pichia have residual glucose residues on the 
sugar moiety when such proteins leave the ER for the Golgi apparatus. The residual 
glucose molecules present on the sugar structure prevent the complete digestion of the 
sugar moiety by an a-l,2-mannosidase, and the introduction of an exogenous 

15 glucosidase can facilitate the removal of these glucose residues. 

According to the present invention, the nucleotide sequence encoding a 
glucosidase II can derive from any species. Glucosidase II genes have been cloned 
from a number of mammalian species including rat, mouse, pig and human. The 
glucosidase II protein from these mammalian species consists of an alpha and a beta 

20 subunit. The alpha subunit is about 1 10 kDa and contains the catalytic activity of the 
enzyme, while the beta subunit has a C-terminal HDEL ER-retention sequence and is 
believed to be important for the ER localization of the enzyme. The glucosidase II 
gene from 5*. cerevisiae has also been cloned (ORF YBR229c, located on chromosome 
II). This gene encodes a protein of about 1 10 kDa, which shows a high degree of 

25 homology to the mammalian alpha subunits. 

A preferred glucosidase II gene for use in the present vectors is from a 
fungal species such as Pichia pastoris and S. cerevisiae. An example of a fungal 
glucosidase II gene is the S. cerevisiae glucosidase II alpha subunit gene. 
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According to the present invention, the nucleotide sequence can also 
encode only a functional part of a glucosidase II. By "functional part" is meant a 
polypeptide fragment of a glucosidase II which substantially retains the enzymatic 
activity of the full-length protein. v By "substantially" is meant at least about 40%, or 
5 preferably, at least 50% or more of the enzymatic activity of the full-length 

glucosidase II is retained. Functional parts of a glucosidase II can be identified and 
made by those skilled in the art using a variety of techniques known in the art. 

In a preferred embodiment of the present invention, the glucosidase II 

protein is engineered to include an ER retention signal such that the protein expressed 
10 in a methylotrophic yeast strain is targeted to the ER and retains therein for function. 

ER retention signals are as described hereinabove, e.g., the HDEL peptide sequence. 

The glucosidase II for use in the present invention can be further modified, 

e.g., by insertion of an epitope tag to which antibodies are available, such as Myc, HA, 

FLAG, and His6 tag, which are well-known in the art. 
15 According to the present invention, the "knock-in" vectors can include 

either or both of an a- 1,2-mannosidase coding sequence and a glucosidase II coding 

sequence. 

Further according to the present' invention, the nucleotide sequence coding 
for the enzyme to be expressed (e.g., an a-l,2-manrtosidase or a functional part 
20 thereof, or a glucosidase II or a functional part thereof) can be placed in an operable 
linkage to a promoter and a 3' termination sequence. 

Promoters appropriate for expression of a protein in a. methylotrophic yeast 
can include both constitutive promoters and inducible promoters. Constitutive 
promoters include e.g., the Pichia pastoris glyceraldehyde-3-phosphate 
25 dehydrogenase promoter ("the GAP promoter"). Examples of inducible promoters 
include, e.g., the Pichia pastoris alcohol oxidase I promoter ("the AOXI promoter") 
(U.S. Patent No. 4,855,231), or the Pichia pastoris formaldehyde dehydrogenase 
promoter ("the FLD promoter") (Shen et al. Gene 216: 93-102, 1998). 
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3' termination sequences are sequences 3' to the stop codon of a structural 
gene which function to stabilize the mRNA transcription product of the gene to which 
the sequence is operably linked, such as sequences which elicit pplyadenylation. 3' 
termination sequences can be obtained from Pichia or other methylotrophic yeast. 
5 Examples of Pichia pastoris 3' termination sequences useful for the practice of the 
present invention include termination sequences from the AOX1 gene, p40 gene, HIS4 
gene and FLD1 gene. 

The vectors of the present invention preferably contain a selectable marker 
gene. The selectable marker may be any gene which confers a selectable phenotype 
10 upon a methylotrophic yeast strain and allows transformed cells to be identified and 
selected from untransformed cells. The selectable marker system may include an 
auxotrophic mutant methylotrophic yeast strain and a wild type gene which 
complements the host's defect. Examples of such systems include the Saccharomyces 
cerevisiae or Pichia pastoris HIS4 gene which may be used to complement his4 
1 5 Pichia strains, or the S. cerevisiae' or Pichia pastoris ARG4 gene which may be used to 
complement Pichia pastoris arg mutants. Other selectable marker genes which 
function in Pichia pastoris include the Zeo*gene, the G418 R gene, and the like. 

The vectors of the present invention can also include an autonomous 
replication sequence (ARS). For example, U.S. Patent No. 4,837,148 describes 
20 autonomous replication sequences which provide a suitable means for maintaining 
plasmids in Pichia pastoris. The disclosure of U.S. Patent No. 4,837,148 is 
incorporated herein by reference. 

The vectors can also contain selectable marker genes which function in 
bacteria, as well as sequences responsible for replication and extrachromosomal 
25 maintenance in bacteria. Examples of bacterial selectable marker genes include 
ampicillin resistance {Amp 9 ), tetracycline resistance {Tef), neomycin resistance, 
hygromycin resistance, and zeocin resistance (Zeo R ) genes. 



19 



WO 02/00856 



PCT/IB01/01514 



According to the present invention, the nucleotide sequence encoding the 
protein to be expressed in a methylotrophic yeast can be placed in an integrative 
vector or a replicative vector (such as a replicating circular plasmid). 

Integrative vectors are disclosed, e.g;, in U.S. Patent No. 4,882,279 which 
5 is incorporated herein by reference. Integrative vectors generally include a serially 
arranged sequence of at least a first insertable DNA fragment, a selectable marker 
gene, and a second insertable DNA fragment. The first and second insertable DNA 
fragments are each about 200 nucleotides in length and have nucleotide sequences 
which are homologous to portions of the genomic DNA of the species to be 
1 0 transformed. A nucleotide sequence containing a structural gene of interest for 
expression is inserted in this vector between the first and second insertable DNA 
fragments whether before or after the marker gene. Integrative vectors can be 
linearized prior to yeast transformation to facilitate the integration of the nucleotide 
sequence of interest into the host cell genome. 
15 Replicative and integrative vectors carrying either or both of an a- 1,2- 

mannosidase coding sequence or a glucosidase II coding sequence can be constructed 
by standard techniques known to one of ordinary skill in the art and found, for 
example, in Sambrook et al. (1989) in Molecular Cloning: A Laboratory Manual, or 
any of a myriad of laboratory manuals on recombinant DNA technology that are 
20 widely available. 

Preferred vectors of the present invention carrying an c^l^-mannosidase 
expression sequence include pGAPZMFManHDEL, pGAPZMFManMycHDEL, 
pP ICZBlVIFManMycHDEL, pGAPZmManHDEL, pGAPZmMycManHDEL, 
pPIC 9mMycManHDEL and pGAPZmMycManHDEL, which are further described in 
25 the Examples hereinbelow. 

Preferred vectors of the present invention carrying a glucosidase II 
expression sequence include pGAPZAGLSII, pPICZAGLSII, pAOX2ZAGLSII, 
pYPTIZAGLSU, pGAPADElglsII, pPICADElglsII, pAOX2ADElglsII, 
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pYPTIADElglsII, pGAPZAglsIIHDEL and pGAPADElglsIIHDEL, which are 
further described in the Examples hereinbelow. 

In another aspect, the present invention provides "knock-out" vectors 
which, when introduced into a methylotrophic yeast strain, inactivate or disrupt a gene 
thereby facilitating the reduction in the glycosylation of glycoproteins produced in the 
methylotrophic yeast strain. 

In one embodiment, the present invention provides a "knock-out" vector 
which, when introduced into a methylotrophic yeast strain, inactivates or disrupts the 
Ochl gene; 

The S. cerevisiae OCH1 gene has been cloned (Nakayama et al. EMBO J. 
11: 251 1-2519, 1992). It encodes a membrane bound a-l,6-mannosyltransferase, 
localized in the early Golgi complex, that is functional in the initiation of a-1,6- 
polymannose outer chain addition to the N-linked core oligosaccharide (Man 5 GlcNAc 2 
and Man 8 GlcNAc 2 ) (Nakanishi-Shindo et al. J. Biol Chern. 268: 26338-26345, 1993). 

A Pichia sequence has been described in Japanese Patent Application No. 
07145005 that encodes a protein highly homologous to the S. cere\>isiae OCHL For 
purpose of the present invention, this sequence is denoted herein as "the Pichia OCH1 
gene". Those skilled in the art can isolate the OCH1 genes from other methylotrophic 
yeasts using techniques well known in the art. 

According to the present invention, a disruption in the OCH1 gene of a 
methylotrophic yeast can result in either the production of an inactive protein product 
or no product. The disruption may take the form of an insertion of a heterologous 
DNA sequence into the coding sequence and/or the deletion of some or all of the 
coding sequence. Gene disruptions can be generated by homologous recombination 
essentially as described by Rothstein (in Methods in Enzymology, Wu et al., eds., vol 
101:202-211, 1983). 

To disrupt the Ochl gene by homologous recombination, an Ochl knock- 
out vector can be constructed in such a way to include a selectable marker gene. The 

selectable marker gene is operably linked, at both 5' and 3' end, to portions of the 

21 



' WO 02/00856 



PCT/IB01/01514 



Octil gene of sufficient length to mediate homologous recombination. The selectable 
marker can be one of any number of genes which either complement host cell 
auxotrophy or provide antibiotic resistance, including URA3, LEU2 and HIS3 genes. 
Other suitable selectable markers include the CAT gene, which confers 
5 chloramphenicol resistance on yeast cells, or the lacZ gene, which results in blue 
colonies due to the expression of active 0-galactosidase. Linearized DNA fragments 
of an Ochl knock-out vector are then introduced into host methylotrophic yeast cells 
using methods well known in the art. Integration of the linear fragments into the 
genome and the disruption of the Ochl gene can be determined based on the selection 
1 0 marker and can be verified by, for example, Southern Blot analysis. 

Alternatively, an Ochl knock-out vector can be constructed in such a way 
to include a portion of the Ochl gene to be disrupted, which portion is devoid of any 
Ochl promoter sequence and encodes none or an inactive fragment of the Ochl 
protein. By "an inactive fragment", it is meant a fragment of the Ochl protein which 
15 has, preferably, less than about 10% and most preferably, about 0% of the activity of 
the full-length OCH1 protein. Such portion of the OCH1 gene is inserted in a vector 
in such a way that no known promoter sequence is operably linked to the OCH1 
sequence, but that a stop codon and a transcription termination sequence are operably 
linked to the portion of the Ochl gene. This vector can be subsequently linearized in 
20 the portion of the OCH1 sequence and transformed into a methylotrophic yeast strain 
using any of the methods known in the art. By way of single homologous 
recombination, this linearized vector is then integrated in the OCH1 gene. Two Ochl 
sequences are produced in the chromosome as a result of the single homologous 
recombination. The first Ochl sequence is the portion Of the Ochl gene from the 
25 vector, which is now under control of the OCH1 promoter of the host methylotrophic 
yeast, yet cannot produce an active OCH1 protein as such Ochl sequence codes for no 
or an inactive fragment of the OCH1 protein, as described hereinabove. The second 
Ochl sequence is a full OCH1 coding sequence, but is not operably linked to any 
known promoter sequence and thus, no active messenger is expected to be formed for 
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synthesis of an active OCH1 protein. Preferably, an inactivating mutation is 
introduced in the OCH1 sequence, to the 5' end of the site of linearization of the vector 
and to the 3' end of the translation initiation codon of OCH1 . By "inactivating 
mutation" it is meant a mutation introducing a stop codon, a frameshift mutation or 
5 any other mutation causing a disruption of the reading frame. Such mutation can be 
introduced into an Ochl sequence using any of the site directed mutagenesis methods 
known in the art. Such inactivating mutation ensures that no functional OCH1 protein 
can be formed even if there exist some promoter sequences 5 1 to the Ochl sequence in 
the knock-out vector. 
10 A preferred Ochl knock-out vector of the present invention is 

pBLURAS'PpOCHl. 

If desired, either or both of a mannosidase expression sequence and a 
glucosidase expression sequence can be carried on the same plasmid used to disrupt 
the OCH1 gene to create a "knock-in-and-knock-out" vector. 
IS Additionally, any of the above-described vectors can further include a 

nucleotide sequence capable of expressing a glycoprotein of interest in a 
methylotrophic yeast strain. 

Another aspect of the present invention is directed to methods of modifying 
methylotrophic yeast strains to reduce glycosylation on proteins produced by the 
20 methylotrophic yeast strains. In accordance with the present methods, methylotrophic 
yeast strains are modified by transforming into these yeast strains with one or more, 
i.e., at least one, knock-in and/or knock-out vectors of the present invention as 
described herein above. 

Methylotrophic yeast strains which can be modified using the present 
25 methods include but are not limited to yeast capable of growth on methanol such as 
yeasts of the genera Candida, Hansemda, Torulopsis, and Pichia. A list of species 
which are exemplary of this class of yeasts can be found in C. Anthony (1982), The 
BiochemistiyofMethylotrophs, 269. Pichia pastor is, Pichia methanolica, Pichiq 
anomola, Hansenula polymorpha and Candida boidinii are examples of 
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methylotrophic yeasts useful in the practice of the present invention. Preferred 
methylotrophic yeasts are of the genus Pichia. Especially preferred are Pichia 
pastoris strains GS1 15 (NRRL Y-15851); GS 190 (NRRL Y-18014) disclosed in U.S. 
Patent No. 4,818,700; PPF1 (NRRL Y-18017) disclosed in U.S. Patent No. 4,812,405; 
5 PPY120H and yGC4; as well as strains derived therefrom. 

Methylotrophic yeast strains which can be modified using the present 
methods also include those methylotrophic yeast strains which have been genetically 
engineered to express one or more heterologous glycoproteins of interest. The 
glycosylation on the heterologous glycoproteins expressed from these previously 
1 0 engineered strains can be reduced by transforming such strains with one or more of the 
vectors of the present invention. 

The vectors of the present invention can be introduced into the cells of a 
methylotrophic yeast strain using known methods such as the spheroplast technique, 
described by Cregg et al. 1985, or the whole-cell lithium chloride yeast transformation 
15 system, Ko et al. Agric. Biol. Chem. 48:341, modified for use in Pichia as described in 
EP 3 12,934. Other published methods useful for transformation of the plasmids or 
linear vectors include U.S. Patent No. 4,929,555; Hinnen et al. Proc. Nat. Acad. Sci. 
USA 75:1929 (1978); Ito et al. J. Bacteriol. 153:163 (1983); U.S. Patent No. 
4,879,23 1; Sreekrishna et al. Gene 59: 115 (1987). Electroporation and PEG1000 
20 whole cell transformation procedures may also be used. Cregg and Russel Methods in 
Molecular Biology: Pichia Protocols, Chapter 3, Humana Press, Totowa, N.J., pp. 27- 
39(1998). 

Transformed yeast cells can be selected by using appropriate techniques 
including but not limited to culturing auxotrophic cells after transformation in the 
15 absence of the biochemical product required (due to the cell' s auxotrophy), selection 
for and detection of a new phenotype, or culturing in the presence of an antibiotic 
which is toxic to the yeast in the absence of a resistance gene contained in the 
transformants. Transformants can also be selected and/or verified by integration of the 
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expression cassette into the genome, which can be assessed by e.g., Southern Blot or 
PCR analysis. 

. In one embodiment, a methylptrophic yeast strain is transformed with a 
vector which includes a nucleotide sequence coding for an cc-l,2-mannosidase or a 
5 functional part thereof. The nucleotide sequence is capable of expressing the a- 1,2- 
mannosidase or the functional part in the methylotrophic yeast strain, and is, 
preferably, integrated into the genome of the methylotrophic yeast strain. 

The expression of an a- 1,2 -mannosidase introduced in a methylotrophic 
yeast strain can be verified both at the mRNA level, e.g., by Northern Blot analysis, 

10 and at the protein level, e.g., by Western Blot analysis. The intracellular localization 
of the protein can be analyzed by using a variety of techniques, including subcellular 
fractionation and immunofluorescence experiments. An ER localization of an a-1, 2- 
mannosidase can be determined by co-sedimentation of this enzyme with a known ER 
resident protein (e.g., Protein Disulfide Isomerase) in a subcellular fractionation 

15 experiment. An ER localization can also be determined by an immunofluorescence 
staining pattern characteristic of ER resident proteins, typically a perinuclear staining 
pattern. 

To confirm that an a-l,2-mannosidase or a functional part thereof 
expressed in a methylotrophic yeast strain has the expected mannose-trimming 

20 activity, both in vitro and in vivo assays can be employed. Typically, an in vitro assay 
involves digestion of an in vitro synthesized substrate, e.g., Man 8 GlcNAc 2 , with the 
enzyme expressed and purified from a methylotrophic yeast strain, and assessing the 
ability of such enzyme to trim MansGlcNAc 2 to, e.g., Man 5 GlcNAc 2 . In in vivo 
assays, the a-l 5 2-mannosidase or a part thereof is co-expressed in a methylotrophic 

25 yeast with a glycoprotein known to be glycosylated with N-glycans bearing terminal 
a-l,2-linked mannose residues in such yeast. The enzymatic activity of such an <x- 
1 ,2-mannosidase or a part thereof can be measured based on the reduction of the 
number of ct-l,2-linked mannose residues in the structures of the N-glycans of the 
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glycoprotein. In both in vitro and in vivo assays, the composition of a carbohydrate 
group can be determined using techniques that are well known in the art and are 
illustrated in the Examples hereinbelow. 

In another embodiment, a methylotrophic yeast strain is transformed with a 
5 vector which includes a nucleotide sequence coding for a glucosidase II or a functional 
part thereof. The nucleotide sequence is capable of expressing the glucosidase II or 
the functional part in the methylotrophic yeast strain, and is, preferably, integrated into 
the genome of the methylotrophic yeast strain. 

The enzymatic activity of a glucosidase II or a functional part thereof 
1 0 expressed in a transformed methylotrophic yeast strain can be assessed using a variety 
of assays. For example, methylotrophic yeast cells transformed with a sequence 
encoding a glucosidase II or a part thereof can be set to grow on solid medium 
containing a substrate of the glucosidase, e.g., 5-bromo-4-chloro-3-indolyl-a-D- 
glucopyranoside or 4-MU-a-D-Glc. When the enzyme is expressed by the Pichia and 
1.5 secreted extracellularly, the substrate is acted upon by the enzyme, giving rise to 
detectable signals around the colonies such as blue color or fluorescent glow. 
Alternatively, liquid culture medium containing the expressed protein molecules can 
be collected and incubated in test tubes with a substrate, e.g., p-nitrophenyl-ct-D- 
glucopyranoside. The enzymatic activity can be determined by measuring the specific 
20 product released. Moreover, in vivo assays can be employed, where a glucosidase II is 
co-expressed in yeast with a glycoprotein known to be N-glycosylated with glucose 
residues, e.g., influenza neuraminidase. The enzymatic activity of the glucosidase II 
can be measured based on the reduction of the glucose content in the sugar chain(s) of 
the glycoprotein. 

25 . In still another embodiment of the present invention, a methylotrophic 

yeast strain is transformed with an Ochl knock-out vector. As a result of the 
transformation and integration of the vector, the genomic Ochl gene in the yeast 
strains is disrupted. 
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In a further embodiment of the present invention, a methyl otrophic yeast 
strain is transformed with any combination of an a-l,2-mannosidase expression 
vector, a glucosidase II expression vector, and an Ochl knock-out vector. Such 
modification can be achieved by serial, consecutive transformations, i.e., introducing 
5 one vector at a time, on alternatively by co-transformation, i.e., introducing the vectors 
simultaneously. 

The modified methylotrophic yeast strains described herein above can be 
further modified if desired. For example, additional disruption of genes encoding any 
other Pichia mannosyltransferases can be made. Genes encoding mammalian 

10 enzymes can also be introduced to produce glycoproteins having hybrid- or complex- 
type N-glycans, if desired. 

Methylotrophic yeast strains which are modified by using the present 
methods, i.e., by transforming with one or more of the vectors of the present invention, 
form another embodiment of the present invention. 

15 It should be understood that certain aspects of the present invention, 

especially the introduction of an intracellularly expressed a-1 ,2-mannosidase activity, 
are also useful to obtain a reduced glycosylation of the O-linked glycans on 
glycoproteins produced in a methylotrophic yeast, as it is known in the art that these 
O-linked glycans consist mainly of a-l,2-linked mannose residues. O-linked glycans 

20 as used herein refers to carbohydrate structures linked to serine or threonine residues 
of glycoproteins. 

A further aspect of the invention is directed to methods of producing a 
glycoprotein with reduced glycosylation in a methylotrophic yeast, especially a 
glycoprotein heterologous to the methylotrophic yeast. 
25 "A glycoprotein" as used herein refers to a protein which, in 

methylotrophic yeasts, is either glycosylated on one or more asparagines residues or 
on one or more serine or threonine residues, or on both asparagines and serine or 
threonine residues. 
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The term "reduced glycosylation" refers to a reduced size of the 
carbohydrate moiety on the glycoprotein, particularly with fewer mannose residues, 
when the glycoprotein is expressed in a methylotrophic yeast strain which has been 
modified in accordance with the present invention, as compared to a wild type, 
5 unmodified strain of the methylotrophic yeast. 

In accordance with the present invention, the production of a glycoprotein 
of interest with reduced glycosylation can be achieved in a number of ways. A 
nucleotide sequence capable of expressing a glycoprotein can be introduced into a 
methylotrophic yeast strain which has been previously modified in accordance with 
1 0 the present invention, i.e., a strain transformed with one or more of the vectors of the 
present invention and capable of producing glycoproteins with reduced glycosylation. 
Alternatively, a methylotrophic yeast strain which has already been genetically 
engineered to express a glycoprotein can be transformed with one or more of the 
vectors of the present invention. Otherwise, if a methylotrophic yeast strain does not 
1 5 express a glycoprotein of interest, nor is the strain transformed with any of the vectors 
of the present invention, such yeast strain can be transformed, either consecutively or 
simultaneously, with both a nucleotide sequence capable of expressing the 
glycoprotein and one or more vectors of the present invention. Additionally, a 
methylotrophic yeast strain can be transformed with one or more of the present knock- 
20 in and/or knock-out vectors which also include a nucleotide sequence capable of 
expressing a glycoprotein in the methylotrophic yeast strain. 

The nucleotide sequence capable of expressing a glycoprotein in a 
methylotrophic yeast can be made to include from 5' to 3', a promoter, a sequence 
encoding the glycoprotein, and a 3' termination sequence. Promoters and 3' 
25 termination sequences which are suitable for expression of a glycoprotein can include 
any of those promoters and 3' termination sequences described hereinabove. 

The nucleotide sequence for expression of a glycoprotein can include 
additional sequences, e.g., signal sequences coding for transit peptides when secretion 
of a protein product is desired. Such sequences are widely known, readily available 
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and include Saccharomyces cerevisiae alpha mating factor prepro (amf), Pichia 
pastoris acid phosphatase (PHO 1 ) signal sequence and the like. 

The nucleotide sequence for expression of a glycoprotein can be placed on 
a replicative vector or an integrative vector. The choice and construction of such 
5 vectors are as described hereinabove. 

The nucleotide sequence capable of expressing a glycoprotein can be 
carried on the same replicative plasmid as a plasmid-borne oc-l,2-mannosidase or 
glucosidase II expression unit. Alternatively, the nucleotide sequence containing the 
glycoprotein coding sequence is carried on a separate plasmid or integrated into the 
10 host genome. 

Glycoproteins produced can be purified by conventional methods. 

Purification protocols can be determined by the nature of the specific protein to be 

purified. Such determination is within the ordinary level of skill in the art. For' 

example, the cell culture medium is separated from the cells and the protein secreted 

1 5 from the cells can be isolated from the medium by routine isolation techniques such as 

precipitation, immunoadsorption, fractionation or a variety of chromatographic 
methods. 

Glycoproteins which can be produced by the methods of the present 
invention include, e.g., Bacillus amyloliquefaciens a-amylase, S. cerevisiae invertase, 

20 Trypanosoma cruzi /ram-sialidase, HIV envelope protein, influenza virus A 

haemagglutinin, influenza neuraminidase, Bovine herpes virus type-1 glycoprotein D, 
human angiostatin, human B7-1, B7-2 and B-7 receptor CTLA-4, human tissue factor, 
growth factors (e.g., platelet-derived growth factor), tissue plasminogen activator, 
plasminogen activator inhibitor-I, urokinase, human lysosomal proteins such as o- 

25 galactosidase, plasminogen, thrombin, factor XIII and immunoglobulins. For 

additional useful glycoproteins which can be expressed in the genetically engineered 
Pichia strains of the present invention, see Bretthauer and Castellino, Biotechnol 
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Appl. Biochem. 30: 193-200 (1999), and Kukuruzinska et al. Ann Rev. Biochem. 56: 
915-44(1987). 

Glycoproteins produced by using the methods of the present invention, i.e 
glycoproteins with reduced glycosylate, are also part of the present invention. 
5 Still another aspect of the present invention provides kits which contain 

one or more of the knock-in vectors, knock-out vectors, or knock-in-and-knock-out 
vectors of the present invention described abo ve. More particularly, a kit of the 
present invention contains a vector capable of expressing an a-mannosidase I in a 
methylotrophic yeast, a vector capable of expressing a glucosidase II in a 
10 methylotrophic yeast, a vector capable of disrupting the Ochl gene in a 

methylotrophic yeast, a vector capable of expressing both a glucosidase II and an a- 
mannosidase, a vector a vector capable of disrupting the Ochl gene and capable of 
expressing either or both of a glucosidase II and an a-mannosidase, or any 
combinations thereof. 

15 The kit can also include a nucleotide sequence which encodes and is 

capable of expressing a heterologous glycoprotein of interest. Such nucleotide 
sequence can be provided in a separate vector or in the same vector which contains 
sequences for knocking-in or knocking out as described hereinabove. 

In addition, the kit can include a plasmid vector in which a nucleotide 
20 sequence encoding a heterologous protein of interest can be subsequently inserted for 
transformation into and expression in a methylotrophic yeast. Alternatively, the 
knock-in or knock-out vectors in the kits have convenient cloning sites for insertion of 
a nucleotide sequence encoding a heterologous protein of interest. 

The kit can also include a methylotrophic yeast strain which can be 
25 subsequently transformed with any of the knock-in, knock-out or knock-in-and-knock- 
out vectors described hereinabove. The kit can also include a methylotrophic yeast 
strain which has been transformed with one or more of the knock-in or knock-out 
vectors. Furthermore, the kit can include a methylotrophic yeast strain which has been 
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transformed with a nucleotide sequence encoding and capable of expressing a 
heterologous glycoprotein of interest. 

The present invention is further illustrated by the following examples. 
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Example 1 

Introduction of ot-l,2-Mannosidase to the ER-Golgi Border 



1.1 Plasmids 



Plasmid 


Promoter 


Enzyme 


Tag 


pGAPZMFManHDEL 


GAP 


r. reesei a-l,2-mannosidase 




pGAPZMFManMycHDEL 


GAP 


T. reesei a-l,2-mannosidase 


Myc 


pPICZBMFManMycHDEL 


AOX1 


T. reesei a-l,2-mannosidase 


Myc 


pGAPZMFmManHDEL 


GAP 


mouse mannosidase IB 
catalytic domain 




pGAPZMFmMycManHDEL 


GAP 


mouse mannosidase IB 
catalytic domain 


Myc 



5 

The Trichoderma reesei ct-l,2-mannosidase gene has been isolated and 



described by Maras et al. (J. Biotechnol 77;255-263, 2000). The sequence of this 
gene is available at NCBI Genbank under Accession No. AF212153. A construction 
fragment was generated by PCR using the pPIC9MFmanase plasmid (same as 

10 pPPlMFmdsl described by Maras et al. (2000)) as the template and using the 

following oligonucleotide primers: 5 -G ACTG GTTC C AATTG AC A A GC-3 1 (SEQ ID 
NO:2) and 5'-AGTCTAGATTACAACTCGTCGTGAGCAAGGTGGCCGCCCCG 
TCG-3' (SEQ ID NO:3). The resulting product contained the 3' end of the Pichia 
pastoris AOXI promoter, the prepro-signal sequence of the S. cerevisiae cc-mating 

15 factor, the open reading frame of the trichoderma reesei a-l,2-mannpsidase cloned in 
frame with the signal sequence, the coding sequence for HDEL, a stop codon and an 
Xba I restriction site. This fragment was digested with Eco RI and Xba I, removing 
the 5* sequences up to the mannosidase ORF, and then cloned into the vector 
pGAPZaA (Invitrogen, Baarn, The Netherlands) which had been digested with Eco RI 
20 mdXba I, thus restoring the fusion with the S. cerevisiae a-mating factor signal 
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sequence. The resulting plasmid was named pGAPZMFManHDEL and is graphically 
depicted in Figure 1. The ORF sequence of the MFManHDEL fusion in 
pGAPZMFManHDEL is set forth in SEQ ID NO: 14. 

In order to introduce the coding sequence for a c-Myc tag between the 
catalytic domain and the HDEL-signal, the 3' end of the ORF of T. reesei cc-1,2- 
mannosidase was PCR-amplified using a sense primer 5 1 - 
CCATTGAGGACGCATGCCGCGCC-3' (SEQ ID NO: 4) (containing an Sph I 
restriction site) and an antisense primer 

GTATCTAGATTACAACTCGTCGTGCAGATCCTCTTCTGAGATGAGTrTTTGT 
TCAGCAAGGTGGCCGCCCCGTCGTGATGATGAA (SEQ ID NO: 5) (containing 
the coding sequences of the c-Myc tag and the HDEL signal, followed by a stop codon 
and an Xba I restriction site). The resulting PCR product was digested with Sph I and 
Xba I, purified by agarose gel electrophoresis and inserted into pGAPZMFManHDEL 
which had been cut with the same restriction enzymes, resulting in plasmid 
pGAPZMFManMycHDEL. To put the ORF of pGAPZMFManMycHDEL under the 
control of the inducible AOXI promoter, the entire ORF was liberated from ' 
pGAPZMFManMycHDEL with Bst BI and Xba I, and cloned in pPICZB (Invitrogen, 
Baarn, The Netherlands), resulting in pPICZBMFManMycHDEL. 

Cloning of the mouse mannosidase IB catalytic domain with concomitant 
addition of the coding sequence for a C-terminal HDEL-tag was done by PCR on a 
mouse cDNA library (mRNA isolated from the L929 cell line induced with 
cycloheximide and mouse Tumor Necrosis Factor. Average insert length of the cDNA 
library was 2000 bp). The PCR oligonucleotide primers used were: 
5^AACTCGAGATGGACTCTTCAAAACACAAACGC3' (SEQ ID NO: 6) and 
5TTGCGGCCGCTTACAACTCGTCGTGTCGGACAGCAGGATTACCTGA3 , 
(SEQ ID NO: 7). The product contained a 5\ Xho I site and the coding sequence for C- 
terminal HDEL-site, followed by a stop codon and a Not I site at the 3' end. The 
product was cloned in pGAPZaA via the Xho I /Not I sites in the PCR product and the 

vector, resulting in an in frame fusion of the mouse mannosidase catalytic domain 
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with the S. cerevisiae cc-mating factor signal sequence. The sequence of the entire 
open reading frame generated is set forth in SEQ ID NO: 15. 



1.2 Yeast Transformation and Genomic Integration 



Table 2 


Parental strain 


DNA transformed 


GS115(his4) 


poAjrZ/iVLrManillJliJL 




pPIC9MFManHDEL 




pPIC9mManHDEL . 




p r I L, y mMycManHDEL 




pGAPZmManHDEL 




p G APZmMycManHDEL 


GS1 15 (his4 complemented by 


pGAPZMFManHDEL 


pPIC^InfluenzaHA) 






pGAPZmManHDEL 




p G APZmMycManHDEL 


PPY120H (his4 complemented by 


pGAPZMFManMycHDEL 


. pPIC9sOCHl) 






pPICZBMFManMycHDEL 


yGC4 (his4 argl ade2 ura3 


pPIC9InfluenzaNeuraminidase 


complemented by 




pBLURA5TpOCHl) 






pGAPZMFManHDEL 




pPIC9GJucosepxidase 



All transformations to Pichia pastoris were performed with electroporation 
according to the directions of Invitrogen. Transformants of vectors carrying the 
Zeocin resistance gene were selected on YPD containing 100 |ag/ml Zeocine 
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(Invitrogen, Baarn, the Netherlands) and 1M sorbitol. Selection of transformants of 
pPIC9 derivatives was done on minimal medium lacking histidine and containing 1M 
sorbitol. Genomic integration of the expression cassettes was verified using PCR on 
genomic DNA purified from the Pichia strains using the Yeast Miniprep method 
5 (Nucleon). In all cases concerning the Trichoderma reesei gene fusions, the primers 
used were the sense primer 5'-CCATTGAGGACGCATGCCGCGCC-.3' (SEQ ID NO: 
8), which annealed to the 3' half of the mannosidase ORF, and the antisense primer 3* 
AOXI 5 -GCAAATGGCATTCTGACATCCT-3' (SEQ ID NO: 9), which annealed to 
the AOXI transcription terminator that was present in all our expression constructs. 
10 For the control of genomic integration of the mouse mannosidase transgenes, PCR was 
done using the sense primer 5»GAP S'GTCCCTATTTCAATCAATTGAAS' (SEQ ID 
NO: 10, annealing to the GAP promoter or 5'AOXI 

S'GACTGGTTCCAATTGACAAGCS* (SEQ ID NO: 1 1), annealing to AOXI 
promoter), and the antisense primer 3'AOXI (above). For the expression constructs 
15 containing a Myc tagged Trichoderma reesei a-l,2-mannosidase expression unit, 
further evidence for genomic integration was obtained using Southern Blotting with 
the entire MFManMycHDEL ORF ( 32 P labelled using HighPrime, Boehringer 
Mannheim) as a probe. 

20 13 Expression of a-l,2-mannosidase 

Expression of an cc-l s 2-Mannosidase in GS1 15 strains expressing influenza 
Virus haemagglutinin was verified by qualitative Northern blot. Expression of an ct- 
1,2-Mannosidase in PPY120H strains was verified by anti-Myc Western blot. 

Qualitative Northern Blot — Total RNA was purified from ^/c/z/a strains 
25 and the yield was determined spectrophotometrically. Northern blotting was 

performed according to standard procedures and an estimate of the quantity of RNA 
loaded was made using methylene blue staining of the blot, visualizing the rRNA 
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bands. The blot was probed with a ClaVNarl fragment of the mannosidase, labelled 
with 32 P using HighPrirae (Boehringer Mannheim). 

SDS-PAGE and Western Blotting - Total yeast cell lysates were prepared 
by washing the cells twice with PBS, followed by boiling in 1 volume of 2x 
5 concentrated Laemmli loading buffer for 5 min. The lysate was spun briefly in a 
microcentrifuge prior to gel loading and only the supernatant was loaded. For the 
analysis of proteins secreted into the growth media, the proteins were precipitated 
from 200 pi of these media using desoxycholate/trichloroacetic acid according to 
standard procedures. The pellet was redissoived in 2x concentrated Laemmli loading 
1 0 buffer and the solutions were pH-corrected using Tris. SDS-PAGE was performed 
andfollowed by semidry electroblotting to nitrocellulose membranes. For Western 
Blotting, the 9E10 anti-Myc and the anti-HA mouse monoclonals (Boehringer 
Mannheim) were used at a concentration of lug/ml, and the rabbit anti-PDI antiserum 
(Stressgen) was used at a dilution of 1/500. The secondary antibodies were goat anti- 
1 5 mouse IgG conjugated to alkaline phosphatase for the monoclonals and goat anti- 
rabbit IgG conjugated to peroxidase for the polyclonal (secondary antibodies from 
Sigma). Detection was performed using the NBT/BCIP system for alkaline 
, phosphatase and the Renaissance substrate (NENBiosciences) for peroxidase. 

Imaging of the latter blot result was done on a Lumilager imaging device (Boehringer 
20 Mannheim). 

The results shown in Figure 4 indicated that the great majority of the 
HDEL-tagged protein was retained intracellularly, both when expressed from the 
strong constitutive GAP promoter and when expressed from the strong inducible 
AOXI promoter. 

25 '•• ' •- 

1.4 Localization of a-1.2-Mannosidase 

Isopycnic sucrose density gradient centrifugation - To determine the 
localization of the HDEL-tagged mannosidase, subcellular fractionation was carried 
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out using cells expressing the mannosidase-Myc-HDEL from the strong constitutive 
GAP promoter. 

Briefly, 0.5 g of wet weight yeast cells were lysed using 4x1 mih 
vortexing with 4.5 g glass beads in 1 ml lysis-buffer (50 fnM Tris-HCL pH 7.5 
5 containing 0.6 M sorbitol, 10 mM p-mercaptoethanol and 5. mM MgC^). Between 
vortexing periods, the mixture was placed on ice for 5 min. The supernatant was 
collected and the glass beads were washed once with lysis-buffer, and the supernatant 
of this washing step was added to the first supernatant. This lysate was subjected to a 
differential centrifugation procedure. The PI 0000 pellet was solubilized in 0.5 ml of a 

10 60% sucrose solution in lysis buffer. This solution was placed at the bottom of an 

Ultraclear ultracentrifuge tube (Beckman) of 14 x 89 mm. Subsequently, 1.5 ml each 
of sucrose solutions of 55, 50, 45, 42.5, 40, and 37.5% were carefully layered over 
each other. The tube was filled to the edge with 35% sucrose. Isopycnic sucrose 
gradient centrifugation was performed for 14 h at 180,000g in a Beckman SW 41 rotor 

15 in a Beckman Model L8-70 preparative ultracentrifuge. After completion. 1ml 
fractions were collected from the top and partially dialysed from excess sucrose, 
evaporated to dryness in a vacuum centrifuge. After redissolving the pellet in 
Laemmli buffer, the samples were subjected to SDS-PAGE in triplicate and the 
Western blots were treated with anti-HA. anti-Myc or anti-PDI ("PDI" for Protein 

20 Disulfide Isomerase), respectively. 

The results illustrated almost exact cosedimentation of the 
MFManMycHDEL protein with the Protein Disulfide Isomerase marker protein 
(which is also targeted with a HDEL signal sequence) (Figure 5). In the same assay, 
the HA -tagged OCH1 was distributed over the whole gradient, with the highest 

25 abundance in fractions having a density lower than that of the fractions containing the 
mannosidase and the PDI. This result indicated that the mannosidase was targeted to 
the expected location (the ER-Golgi boundary) by the addition of an HDEL signal. In 
contrast, the mannosidase without HDEL. expressed from inducible alcohol oxidase I 
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promoter (which was of comparable strength as the GAP promoter), was secreted at a 
high level of about 20 mg/1. 

Immunofluorescence microscopy - To confirm the correct targeting of the 
manriosidase-Myc-HDEL, an immunofluorescence microscopy experiment was 
performed. 

Briefly, yeast cultures were grown to OD 600 in YPD (for 
pGAPZMFManMycHDEL) or in YMP following a YPGlycerol growth phase for 
pPICZBMFManMycHDEL. Formaldehyde was added to the yeast cultures to a final 
concentration of 4% and incubated for 10 min at room temperature. Cells were 
pelleted and resuspended in 50mM potassium phosphate buffer pH 6,5 containing 
ImM MgCl 2 and 4% formaldehyde and incubated for 2h at room temperature. After 
pelleting, the cells were resuspended to an OD 60 o=10 in 100 mM potassium phosphate 
buffer pH 7.5 containing ImM MgCl 2 arid EDTA-free Complete™ protease inhibitor 
cocktail (Boehringer Mannheim). To 100 \i\ of cell suspension, 0.6 \xl of p-mercapto- 
ethanol and 20^1 of 20,000 U/ml Zymolyase 100T (ICN) were added, followed by a 
25 minute incubation with gentle shaking. The cells were washed twice in the 
incubation buffer and added to poly-lysine coated cover slips (these are prepared using 
adhesive rings normally in use for reinforcing perforations in paper). Excess liquid 
was blotted with a cotton swab and the cells were allowed to dry at 20°C. All 
blocking, antibody incubation and washing steps are performed in PBS containing 
0.05% bovine serum albumin. Primary antibodies are used at 2^ig/jil and secondary 
antibodies conjugated to flurophores (Molecular probes) were used at 5jig/jil. The 
nucleus was stained with the nucleic acid stain HOECHST 33258. After fixation and 
cell wall permeabilization, the integrity of the yeast cell morphology was checked in 
phase contrast microscopy and after immunostainiiig, the slides were examined under 
a Zeiss Axiophot fluroresensce microscope equipped with a Kodak digital camera. 
Images were processed using Macprobe 4.0 software and prepared with Corel 
Photopaint 9.0. 
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The Golgi marker protein OCH1-HA gave the typical Golgi staining 
pattern described in the literature (speckle-like staining). Staining with the 9E10 
monoclonal anti-Myc antibody, recognizing mannosidase-Myc-HDEL, gave a 
perinuclear staining pattern with some disparate staining in the cytoplasm, highly 
5 indicative for an ER targeting (Figure 4). 

Based on the foregoing experiments, it is concluded that the Trichoderma 
reesei mannosidase-Myc-HDEL was targeted to the ER-Golgi boundary. 

Example 2 
Co-expression of Mannosidase-HDEL 
10 with Recombinant Glycoproteins 

Co-expres sion of Mannosidase-HDEL with the Trypanosoma cruzi /r<my-SiaIidase 
The cloning of a Trypanosoma cruzi /nmy-sialidase gene coding for an 
active /nmy-sialidase member without the C-terminal repeat domain has been 

15 described by Laroy et al. {Protein Expression and Purification 20: 389, 2000) which is 
incorporated herein by reference. The sequence of this Trypanosoma cruzi trans- 
sialidase gene is available through NCBI Genbahk under the Accession No. 
AJ276679. For expression in P. pastoris, the entire gene was cloned in pHILD2 
(Invitrogen, San Diego, CA), creating pHILD2-TS. To allow better secretion, pPIC9- 

20 TS was created in which tamy-sialidase was linked to the prepro secretion signal of 
the yeas^t a-mating factor. Plasmids pPIC9-TSE and pCAGGS-prepro-TSE were 
created where the epitope E-tag was added to the C-terminal of the /ram'-sialidase to 
allow easy detection and purification. The construction of pHILD2-TS, pPIC9-TSE 
and pCAGGS-prepro-TSE has been described by Laroy et al. (2000), incorporated 

25 herein by reference. The vectors used in the construction were made available through 
http://www.belspo.be/bccm/lmbp.htm#main for pCAGGS (No. LMBP 2453), 
Invitrogen, San Diego, CA for pHILD2 and pPIC9, and Pharmacia Biotech for 
pCANTAB-5E. 
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Plasmid pPIC9-TSE was linearized with SstI and was transformed into P. 
pastoris GS1 15 (his4) strain by electroporation according to the manufacturer's 
instructions (Invitrogen). One of the transformants was further transformed with 

plasmid pGAPZMFManHDEL, establishing a strain co-expressmg Mannosidase- 
5 HDEL and the Trypanosoma cruzi /raws-sialidase. 

Fermentation and protein purification was according to the procedures 
described by Laroy et al. (2000). 

Purified //ww-sialidase was subject to carbohydrate analysis according to 
Callewaert et al, Glycobiology 11,4, 275-281, 2001. Briefly, the glycoproteins were 
1 0 bound to the PVDF membrane in the wells of a 96-well plate, reduced, alkylated and 
submitted to peptide-N-glycosidase F deglycosylation. The glycans were derivatised 
with 8-amino-l,3,6-pyrenetrisulfonic acid by reductive animation. Subsequently, the 
excess free label was removed using Sephadex Gl 0-packed spin columns and the 
glycans were analysed by electrophoresis on a 36 cm sequencing gel on an ABI 377A 
15 DNA -sequencer and detected using the built-in argon laser. Digests with 3 mU/ml 
purified T. reesei a-l,2-mannosidase (described by Maras et al., J. Biotechnol. 77, 
255-63, 2000) were also performed in 20 mM sodium acetate pH=5.0. The glycans 
derived from 1 jag of the purifed recombinant glycoproteins were used as the substrate. 
1U of the ct-l,2-mannosidase is defined as the amount of enzyme that releases 1 pmol 
20 of mannose from baker's yeast mannan per minute at 37°C and pH=5.0. 

As can be seen in Figure 6, panel B, the major N-glycan on franj-sialidase 
was Man 8 GlcNAc 2 (Compare with panel F, representing an analysis of the N-glycans 
of bovine RNAseB. The one but last peak in this profile is Man 8 GlcNAc 2 , the first 
peak is Man 5 GlcNAc 2 ). In vitro, this glycan was digestible to Man 5 GlcNAc 2 with a- 
25 1 ,2rmannosidase (Figure 6, panel C). In the N-glycan profile of the trans-sialidase 
co-expressed with mannosidase-HDEL, the major peak corresponded to 
Man 5 GlcNAc 2 (Figure 6, panel D). 
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Co-expression of Mannos idase-HDEL with the Influenza A virus haemagglutinin 
The Influenza A virus haemagglutinin was known to be glycosylated in 
Pichia pasioris - with high-mannose N-glycans containing 9-12 mannose residues 
(Saelens et al. Eur. J.Biochem. 260: 166-175; 1999). The effect of a co-expressed 
5 mannosidase on the N-glycans of the haemagglutinin was assessed in an N-glycan 
profiling method described below. In addition, to compare the efficiency of the 
Trichoderma enzyme (having a temperature optimum of 60°C) with a mammalian 
mannosidase having a temperature optimum of 37°C, the catalytic domain of the 
mouse mannosidase IB from a mouse cDNA -library was cloned and tagged with a 

10 HDEL signal by PCR amplification. This ORF was cloned after the prepro-signal 
sequence of the S. cerevisiae a-mating factor under the control of the GAP promoter. 
Expression of the mannosidase-HDEL transgenes on the mRNA level was confirmed 
by qualitative Northern blotting. 

The haemagglutinin was expressed and purified from a non-mannosidase 

15 expressing control strain and from a strains co-expressing the Trichoderma reesei 

mannosidase-HDEL or the mouse mannosidase IB-HDEL according to the procedure 
described by Kulakosky et al. Glycobiology 8: 741-745 (1998). The purified 
haemagglutin was subjected to PNGase F digestion as described by Saelens et al. Eur. 
J. Biochem. 260: 166-175, 1999. The proteins and glycans were precipitated with 3 

20 volumes of ice-cold acetone and the glycans were extracted from the pellet with 60% 
methanol. Following vacuum evaporation, the glycans were labeled with 8-amino- 
1,3,6 pyrenetrisulfonic acid by adding 1 ^1 of a 1:1 mixture of 20 mM APTS in 1.2M 
citric acid and 1M N a CNBH 3 in DMSO and incubating for 16h at 37°C at the bottom 
of a 250 \xl PCR-tube. The reaction was stopped by the addition of 10 ul deionized 

25 water and the mixture was loaded on a 1 .2 cm Sephadex G10 bed packed to dryness in 
a microspin-column by centrifugation in a swinging bucket rotor, which provided for a 
flat resin surface. After loading, 50 \i\ deionised water was carefully added to the 
resin bed and the spin column was briefly centrifuged for 5 seconds at 750g in a ' 
tabletop centrifuge. This elution process was repeated twice and all the eluates were 
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pooled and evaporated to dryness in a Speedvac vacuum centrifuge (Savant). The 
labeled glycans were reconstituted in 1.5 ul gel loading buffer containing 50% 
formamide and 0.5 ul Genescan 500™, labeled with rhodamine (Perkin Elmer 
Bioscience), serving as an internal reference standard. This mixture was loaded oh a 
DNA-sequencing gel containing 10% of a 19:1 mixture of acrylamide.bisacrylamide 
(Biorad, Hercules, CA, USA) and made up in the standard DNA-sequencing buffer 
(89 raM Tris, 89 mM borate, 2.2 mM EPTA). Polymerization of the gel was 
catalyzed by the addition of 200 pi 10% ammononiumpersulfate solution in water and 
20 pi TEMED. The gel was of the standard 36 cm well-to-read length and was run On 
an Applied Biosystems Model 373A DNA-sequencing apparatus. Prerunning of the 
gel was done at 1000 V for 15 min. and after loading, the gel was electrophoresed for 
8h at 1250 V without heating. This methodology gives a limit of detection of 10 fmol 
per peak. The data were analysed with Genescan 3.0 software. 

As shown in Figure 7, the Trichoderma reesei a- 1 ,2-mannosidase 
15 provided the most complete reduction in the number of a-l,2-mannoses present on the 
N-glycans. The N-glycan processing by mouse mannosidase IB-HDEL was less 
efficient than by the Trichoderma reesei a- 1 ,2-mannosidase. 

Despite the efficient removal of ct-1 ,2-mannoses from the N-glycans of 
haemagglutinin, no Mah 5 GIcNAc 2 was obtained. Even after digestion of the N- 
20 glycans with 3 mU of purified Trichoderma reesei a- 1 ,2-mannosidase, only 

Man 6 GlcNAc 2 was obtained as the smallest sugar chain. These results indicated that 
the remaining residues were possibly cc-l,6-linked mannoses, originating from the 
initiating OCH 1 a- 1 ,6-mannosyltransferase enzymatic activities. OCH 1 was observed 
to be localized to very early part of the Golgi apparatus and could act on the N-glycans 
25 of haemagglutinin before complete digestion of the Man 8 GlcNAc 2 precursor to 
Man 5 GlcNAc 2 by the mannosidases-HDEL. Thus, for proteins whose glycans are 
efficiently modified by the a-l,6-mannosyltransferase, an inactivation of the OCHl 
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gene coding for the transferase would be desirable in order to obtain proteins with 
Man 5 GlcNAc2. 
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Example 3 
Inactivation of the Pichia Ochl Gene: 

A Pichia pastoris sequence was found in the GenBank under Accession 
5 No. E12456 and was described in Japanese Patent Application No. 07145005, 

incozporated herein by reference. This sequence shows all typical features of an oc- 
1,6-mannosyltransferase and is most homologous to the S. cerevisiae OCHl, thus 
referred to herein as the Pichia pastoris Ochl gene. 

First, the full ORF of the Pichia pastoris Och 1 gene was PCR cloned in 
10 pUCl 8 to obtain plasmid pUC18pOchl. pUC18pOchl was cut with Hindlll, blunt- 
ended with T4 polymerase, then cut with Xbal, releasing a fragment containing the 5 ' 
part of the Pichia pastoris Ochl gene. This fragment was ligated into the vector 
pBLURA DC (available from the Keck Graduate Institute, Dr. James Cregg, 
http://www.kgi.edu/html/noncore/faculty/cregg/cregg.htm), which had been cut with 
15 Eco RI, blunt-ended with T4 polymerase, and then cut with Nhe I. This ligation 
generated pBLURA5'PpPCHl, as shown in Figure 8. 

Disruption of this Pichia OCHl gene in the Pichia genome was achieved 
, by single homologous recombination using pBLURA5TpOCHl, as illustrated in 
Figure 9. As a result of the single homologous recombination, the Ochl gene on the 
20 Pichia chromosome was replaced with two Ochl sequences: one consisted only about 
the first one third of the full Ochl ORF, the other had a ftill Ochl ORF without a Ochl 
promoter. Single homologous recombination was achieved as follows. Cellsx>fthe 
Pichia strain yGC4 were transformed by electroporation with pBLURAS'PpOCHl 
which had been linearized with the single cutter Bst BI. About 500 transformants 
25 were obtained on minimal medium containing 1M sorbitol, biotin, arginine, adenine 
and histidine and incubation at 27°C. Thirty-two of these transformants were picked 
and re-selected under the same conditions. Twelve clones were further analyzed for 
correct genomic integration of the cassette by PCR. Seven of the twelve URA 
prototrophic clones contained the cassette in the correct locus. 
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One of the Ochl -inactivated clones was also further transformed with 
pGAPZMFManHDEL to produce "supertransfprmants". Both the Ochl -inactivated 
clone and three supertransformants also expressing the ManHDEL were evaluated in 
cell wall glycan analysis as follows. Yeast cells were grown in 10 ml YPD to an 
5 00600=2 and mannoproteins were prepared by autoclaving the yeast cells in 20 mM 
sodium citrate buffer pH7 for 90 min at 120 °C and recovery of the supernatant after 
centrifugation. Proteins were precipitated from this supernatant with 3 volumes of 
cold methanol. The protein preparation obtained in this way was used for N-glycan 
analysis using DSA-FACE as described by Callewaert et al. (2001) Glycobiology 1 1, 

10 275-281. As shown in Figure 10. there was an increased amount of Man 8 GlcNAc 2 
glycan in the Ochl-inactivanted clone as compared to parent strain yGC4, indicative 
of a reduced activity of the Ochl enzyme. In all three supertransformants which also 
expressed the HDEL-tagged a- 1 ,2 mannosidase, the production of Man 5 GlcNAc 2 was 
observed. Furthermore, upon digestion of the same glycan mixtures with 3 mU/ml 

1 5 purified recombinant Trichoderma reesei ct-l,2-mannosidase, more Man 5 GlcNAc 2 
was formed in the strain transformed with pBLURAS'PpOCHl than in the parent 
strain (Figure 11, compare panel 2 and 3). 

These results confirmed that the lack of a production of Man 5 glycans on 
recombinantly produced proteins such as haemagglutinin from cells expressing cc-1,2- 

20 mannosidase were due to the activity of the Ochl protein. These results further 

indicate, that the production of glycoproteins with Man 5 glycans could be facilitated by 
the inactivation of the Och l gene. 
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Example 4 

Expression of Glucosidase II in Pichia pastoris 

4.1 Amplification of th e GLSII al p ha subunit ORF from S. cerevisiae. 
5 Genomic DNA was prepared from the S. cerevisiae strain INVS (a, leu2-3, 

112 his3Al, trpl-289, ura3-52), using the Nucleon kit (Amersham). A touch-down 
PCR reaction was performed using this genomic DNA as template and the LA 
TaKaRa polymerase (ImTec Diagnostics). The sequence of the PCR primers was 
based on the known sequence of the S. cerevisiae GLSII ORF: 
10 Sense primer: 5' CCG CTCGAG ATG GTC CTT TTG AAA TGGCTC 3' 

Xho I 

(SEQ ID NO: 12) 

Antisense primer: 5' CCGGGCCCA AAA ATA ACT TCC CAA TCT TCA 

Apa I 

15 G 3' (SEQ ID NO: 13) 

4.2 Cloning of the .y. cerevisiae glucosidase II ORF into Pichia pastoris exp ression 
vectors. 

Construction of the glucosidase II expression vectors - The PCR fragment 
20 was digested with Xho VApa I and ligated into the pGAPZA vector (Invitrogen), 
thereby placing the ORF under the transcriptional control of the GAP promoter. 
Using this strategy, the myc and the His6 tag were placed in frame to the C-terminus 
of Glucosidase II, creating pGAPZAGLSII. The complete ORF of pGAPZAGLSII 
was then sequenced to ensure that no mutations were generated in the PCR reaction. 
25 The sequence of the vector pGAPZAGLSII was set forth in SEQ ID NO: 1 8. The ••' 
GLSII ORF from the pGAPZAGLSII vector was cloned into yector pPICZA 
(Invitrogen) to create pPICZAGLSII, thereby placing the ORF under the 
transcriptional control of the AOXI promoter. The GLSII ORF from the 
pGAPZAGLSII vector was cloned into vector pAOX2ZA, thereby placing the ORF 
30 under the transcriptional control of the AOX2 promoter. This vector was created by 
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replacing the multi cloning site of vector pAOX2ZB with the multi cloning site of 
pPICZA. Vector pAOX2ZB was generated by replacing the AOX1 promotor of 
pPICZB by the AOX2 promotor region of the AOX2 gene (Martinet et al. 3 
Biotechnology Letters 21). The AOX2 promotor region was generated by PCR on 
5 Piehia genomic DNA with the sense primer 

S'GACGAGATCTTTTTTTCAGACCATATGACCGG 3' (SEQ ID NO: 26) and the 
antisense primer 5 * GCGGAATTCTTTTCTC AGTTGATTTGTTTGT 3' (SEQ ID 
NO: 27). The GLSII ORF from the pGAPZGLSII vector was cloned into vector 
pYPTlZA to create pYPTlZAGLSII, thereby placing the ORF under the 

10 transcriptional control of the YPT1 promoter. Vector pYPTZA was created by 
replacing the AOX1 promoter of pPICZA by the YPT1 promoter present on the 
plasmid pIB3 (GenBank accession number AF027960)(Sears et-al., Yeast 14, pg 783- 
790, 1998). All constructs contain the phleomycin resistance gene. The resulting final 
expression vectors (pGAPZAGLSII, pAOX2ZAGLSII, pPICZAGLSH and 

15 pYPTlZAGLSII) are depicted in Figures 12-15. 

Similar expression vectors were constructed, carrying the Ampicill in 
resistance marker and the Piehia ADE1 selection marker. In principle, the Zeocin 
resistance expression cassette of the plasmids pAOX2ZAGLSII, pGAPZAGLSII and 
pYPTlZAGLSII was replaced by the Ampicillin and Piehia ADE1 cassette of the 

20 vector pBLADE IX (Cregg, J.M.) to result in the vectors pAOX2ADElglsII, 

pGAPADElglsII and pYPTl ADElglsII. Vector pPICADElglsII was obtained by 
inserting the glucosidase II open reading frame into the multiple cloning site of the 
vector pBLADE IX (Cregg, J.M.). The resulting final expression veptors 
(pGAPADElglsII, pAOX2ADElglsII, pPICADElglsII and pYPTl ADElglsII) are 

25 depicted in Figures 16-20. 

Adding the ER retention tag HDEL to. Glucosidase II expression vectors — 

The following primers were used to generate an HDEL-containing PCR fragment: 

. Primer 1 : 5'GCG G GT CGA C/ CA C/GA C/GA A/CT G/TG A/GT TTT AGC CTT 

Sail HDEL stop 
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AGACATGAC3' (SEQIDNO:28) 
Primer 2: 5'CAG GAG CAAA GCT CGT ACG AG 3' (SEQ ID NO:29) 

Spll 

5 PCR was performed on pGAPZAGLSII with Taq pol., at 60°C. The PCR 

fragment of 225 bp was cut with Sal VSpl I and ligated into the Sal VSpl I opened 
pGAPZAGLSII vector, creating plasmid pGAPZAglsIIHDEL. The sequence of 
plasmid pGAPZAglsIIHDEL is set forth in SEQ ID NO: 24. The construction 
strategy and the resulting final expression vectors (pGAPZAglsIIHDEL and 
10 pGAPADE 1 glsIIHDEL) are depicted in Figures 20-21. 

4.3 Transformation of a Pichia pastoris strain. 

Transformation was performed using the conventional electroporation 
techniques, as described by Invitrogen. Cells of the Pichia pastoris strain PPY12-OH 
1 5 were transformed with pGAPZGLSII which had been cut with the single cutter Avr II. 
Transformants were selected based on their resistance to zeocin. 

Genomic analysis of the transformants — Genomic DNA was prepared 
from some zeocin resistant Pichia transformants. A PCR reaction was performed on 
the genomic DNA in order to determine whether or not the glucosidase II gene was 
20 integrated into the yeast genome. PCR was performed using Taq DNA polymerase 
(Boehinger) (2.5 mM MgCl 2 , 55°C for annealing). The primers were the same as the 
ones we used for the amplification of the ORF on S. cerevisiae genomic DNA. 
pGAPZAGLSII transformants were confirmed by the presence of a specific PCR 
product indicative of the glucosidase II ORF. 

25 • 

4.4 Expression and secretion of the S. cerevisiae glucosidase II alpha subunit in 

Pichia pastoris 

Analysis at the transcriptional level RNA was prepared from the 
transformants which scored positive after the genomic analysis. RNA was prepared 
30 using acid phenol. From each sample, 15 \xg of RNA was loaded on a formaldehyde 
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agarose gel. After electrophoresis the RNA was blotted on a Hybond N membrane. 
The membrane was hybridizing using a radioactive probe, which consists of a 344 bp 
glucosidase II specific fragment, corresponding to the 3' region of the glucosidase II 
ORF. No signals were detected with non-transformed control strains, whereas clear 
5 signals were observed with transformants. 

Analysis at the protein level using a double membrane assay — A 
nitrocellulose membrane was placed on a buffered dextrose medium (BMDY). On top 
of that nitrocellulose membrane, a cellulose acetate membrane was placed. Pichia 
transformants of pGAPZAGLSII were streaked on the cellulose acetate and grown for 

10 a few days. The yeast cells remained on the cellulose acetate, while the secreted 

proteins crossed this membrane. As such the secreted protein was captured onto the 
nitrocellulose membrane. After a few days the cellulose acetate, containing the yeast 
colonies, was removed. The nitrocellulose membrane was analyzed for the presence 
of glucosidase II using anti-myc antibody. Most of the transformants gave a clear 

15 signal as compared to a faint, hardly visible signal with the WT, non-transformed 
strain. 

Extracellular expression - PP Y 1 2-OH transformants of the construct 
pGAPZAGLSII(mychis6) (strains 12, 14 and 18) and transformants of the construct 
p G APZ A GLSII(myc)HDEL (strains HI, H2 and H3) were grown for 2 days on 2x10 

20 ml BMDY medium. These 6 transformants earlier scored positive both on the 

genomic level (PCR on gDNA) and on the RNA level (Northern blot). The culture 
medium was collected by centrifugation and concentrated with Vivaspin columns to 
about 1 ml. Proteins from this concentrate were precipitated with TCA, resuspended 
in Laemmli buffer and loaded for SDS-PAGE analysis. Proteins were blotted to 

25 nitrocellulose membrane. The blot was incubated overnight with anti-myc Ab. The 
secondary Ab was linked to peroxidase. Using the Renaissance luminiscence 
detection kit (NEK) and a light sensitive film (Kodak), a strong band at about 110 kDa 
was observed for the transformants 12, 14 and 18, indicating that GLSII was expressed 
and secreted from these transformants. No signal was obtained for the transformants 
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HI -3, which indicate that the HDEL tag, which was added C-terminally to the GLSII 
ORF, resulted in an ER localization of the protein, preventing GLSII to be secreted 
into the growth medium. . ~ 

Intracellular expression - The 6 transform ants and the WT strain were 
grown for 2 days in 500 ml BMDY. The cells were collected by centrifugation, 
washed, resuspended into a minimal volume (50 mM Tris.HCl pH 7.5, 5% glycerol) 
and broken using glass beads. The cell debris was removed through several 
centrifugation steps (low speed centrifugation (2000-3 000g)). Membranes were 
obtained from the supernatant through ultracentrifugation. The pellets were 
resuspended in Laemmli buffer and loaded for SDS-PAGE analysis. The proteins 
were blotted on a nitrocellulose membrane. The intracellular GLSII expression was 
checked using anti-myc Ab and peroxidase conjugated secondary Ab. Following the 
luminescence detection, a band at about 1 10 kDA was observed with the GLSIIHDEL 
tranformants (HI and H3, faint signal for H2), but not with the WT and GLSII 
expression strains. These results clearly indicate the intracellular presence of the 
recombinant GLSII when expressed with a C-terminal HDEL tag. No GLSII was 
detected intracellular^ when this tag was not present. 

4.5 Purification and activity assays of the recombinant glucosidase II alpha 
submit 

A GLSII assay was set up as follows and was tested using a commercially 
available yeast alpha-glucosidase (Sigma) as a positive control. 

Composition: 70 jal 80 mM phosphate-citrate buffer pH 6.8, 7 p.1 250 mM 
mannose, 3.5 jil 250 mM 2-deoxy-D-glucose, 0.8 4-MeUmbelliferyl-alpha-D- 
glucopyranoside (1 jaM). Three assays were performed: one with 1 unit commercial 
enzyme, one without the enzyme and one with the enzyme but without the substrate. 
The assay mixture was incubated overnight at 30°C. When illuminated with UV, only 
the reaction mixture with both the enzyme and the substrate showed fluorescence 
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(Figure 22). This indicates that the assay was very specific in detecting the activity of 
the alpha-glucosidase. 

WT PPY12-OH, strain 1 8 and strain H3 were grown during 2 days in 2x10 
ml growth medium. Cells were spun down and medium was adjusted to 300 mM 
5 NaCl and 1 0 mM imidazol and concentrated with Vivaspin columns to 0.5- lml. 
Medium was loaded onto a Ni-NTA spin column (Qiagen) and the purification was 
performed according to the manufactures recommendations. Protein was eluted from 
the column in 2x100 ^1 elution buffer (50 mMNaH 2 P0 4 , 300 mM NaCl, 250 mM 
imidazol pH 8.0). From each eluate, 20 \il was assayed for its glucosidase II activity. 

10 0.33 units of the commercial enzyme diluted in 20 pi of the elution buffer was used as 
a positive control. Fluorescence was observed with the positive control and the elute 
of strain 18, the strain which secreted the enzyme into the growth medium. These 
results indicate that the recombinant S. cerevisiae GLSII alpha subunit, secreted by 
Pichia pastoris, was a functionally active enzyme. The activity was not seen in the 

15 WT (untransformed) strain, nor in strain H3 as the GLSII was expressed intracellular^ 
(Figure 23). These results also indicate that the beta subunit is not necessary for the 
functionality of the alpha part of the protein. 
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Example 5 
Creating Pichia Strains Expressing 
both Glucosidase II and Mannosidase 

5 Strain GS 1 1 5 was transformed with pGAPZGLSH and pGAPZglsIIHDEL. 

Transformants were selected on YPDSzeo. 

Strain yGC4 was transformed with the following constructs, respectively: 
(1) pGAPADEglsII and pGAPADEglsHHDEL, selection on synthetic 
sorbitol medium without adenine; 

1 0 (2) p GAPZMFM anHDEL : selection on YPDSzeo; and 

(3) pGAPZMFManHDEL/pGAPADEglsIIHDEL: selection on synthetic 
sorbitol medium without adenine and with zeocin. 

Strain yGC4 with OCH1 knock-in and expressing MFmannosidaseHDEL 
was transformed with pGAPADEglsII and pGAPADEglsHHDEL. Selection of 
15 transformants was done on synthetic sorbitol medium without adenine and uracil. 

For all transformations, colonies were obtained. Transformants with the 
expression vector(s) integrated into the genome, determined by PGR, were obtained. 
Expression of GLSII from some of these transformants was observed. 
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SEQUENCE LISTING 



SEQ ID NO: 1 

HDEL (peptide) 

5 

SEQ ID NO: 2 

5 '-GACTGGTTCC AATTGAC AA GC-3 1 
SEQ ID NO: 3 

5'-AGTCTAGATTACAACTCGTCGTGAGCAAGGTGGCCGCCCCG TCG-3' 
10 SEQ ID NO: 4 

CCATTGAGGACGCATGCCGCGCC 

SEQ ID NO: 5 

gtatctagattacaactcgtcgtgcagatcctcttctgagatgagtttttgt 
15 tcagcaaggtggccgccccgtcgtgatgatgaa 

SEQ ID NO: 6 

AACTCGAGATGGACTCTTCAAAACACAAACGC 

20 SEQ ID NO: 7 

TTGCGGCCGCTTACAACTCGTCGTGTCGGACAGCAGGATTACCTGA 

SEQ ID NO: 8 

CCATTGAGGACGCATGCCGCGCC 

25 

SEQ ID NO: 9 

gcaaatgGcattctgacatcct 

SEQ ID NO: 10 

30 . GTCCCTATTTCAATCAATTGAA 

SEQ ID NO:ll 

GACTGGTTCCAATTGACAAGC 

35 SEQ ID NO:12 

CCG CTC GAG ATG GTC CTT TTG AAA TGG CTC 

SEQ ID NO: 13 

CCG GGC CCA AAA ATA ACT TCC CAA TCT TCAG 

40 
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10 



15 



SEQ ID NO: 14 

The ORF sequence of the MFManHDEL fusion in pGAPZMFManHDEL: 

fl/gagamccttcaattmactgctgttttattcgcagcatcctccgcattagctgctccagtcaacactacaacagaagatgaa 
acggcacaaattccggctgaagctgtcatcggttactcagatttagaaggggaWcgatgttgctgttttgccattttccaaca 
gcacaaataacgggttattgtttataaatactactattgccagcattgctgctaaagaagaaggggtatctctcgagaaaagag 
aggctgaagctgaattcgccacaaaacgtggatctcccaaccctacgagggcggcagcagtcaaggccgcattccagac 
gtcgtggaacgcttaccaccattttgcctttccccatgacgacctccacccggtcagcaacagctttgatgatgagagaaacg 
gctggggctcgtcggcaatcgatggcttggacacggctatcctcatgggggatgccgacattgtgaacacgatccttcagta 
tgtaccgcagatcaacttcaccacgactgcggttgccaaccaaggatcctccgtgttcgagaccaacattcggtacctcggt 
ggcctgctttctgcctatgacctgttgcgaggtcctttcagctccttggcgacaaaccagaccctggtaaacagccttctgag 
gcaggctcaaacactggccaacggcctcaaggttgcgttcaccactcccagcggtgtcccggaccctaccgtcttcttcaac 
cctactgtccggagaagtggtgcatctagcaacaacgtcgctgaaattggaagcctggtgctcgagtggacacggttgagc 
gacctgacgggaaacccgcagtatgcccagcttgcgcagaagggcgagtcgtatctcctgaatccaaagggaagcccgg 
aggcatggcctggcctgattggaacgtttgtcagcacgagcaacggtacctttcaggatagcagcggcagctggtccggc 
ctcatggacagcttctacgagtacctgatcaagatgtacctgtacgacccggttgcgtttgcacactacaaggatcgctgggt 
ccttggtgccgactcgaccattgggcatctcggctctcacccgtcgacgcgcaaggacttgacctttttgtcttcgtacaacg 
gacagtctacgtcgccaaactcaggacatttggccagttttggcggtggcaacttcatcttgggaggcattctcctgaacgag 
caaaagtacattgactttggaatcaagcttgccagctcgtactttggcacgtacacccagacggcttctggaatcggccccga 
20 aggcttcgcgtgggtggacagcgtgacgggcgccggcggctcgccgccctcgtcccagtccgggttctactcgtcggca 
ggattctgggtgacggcaccgtattacatcctgcggccggagacgctggagagcttgtactacgcataccgcgtcacggg 
cgactccaagtggcaggacctggcgtgggaagcgttgagtgccattgaggacgcatgccgcgccggcagcgcgtactcg 
tccatcaacgacgtgacgcaggccaacggcgggggtgcctctgacgatatggagagcttctggtttgccgaggcgctcaa 
gtatgcgtacctgatctttgcggaggagtcggatgtgcaggtgcaggccaccggcgggaacaaatttgtctttaacacgga 
25 ggcgcacccctttagcatccgttcatcatcacgaeggggcggccaccttgctcacgacgagttgtoa 

SEQ ID NO : 15 

The ORF sequence of the MFmManHDEL fusion in pGAPZMFmManHDEL: 

3 0 ^gagatttccttcaatttttactgctgttttattcgcagcatcctccgcattagctgctccagtcaacactacaacagaagatgaa 

acggcacaaattccggctgaagctgtcatcggttactcagatttagaaggggatttcgatgttgctgttttgccattttccaaca . 
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gcacaaataacgggttattgtttataaatactactattgccagcattgctgctaaagaagaaggggtatctctcgagatggact 
cttcaaaacacaaacgctttgatctgggcttagaagatgtgttaattcctcacgtagatgccggcaaaggagctaaaaacccc 
ggcgtcttcctgatccatggacccgacgaacacagacacagggaagaagaagagcgtctgagaaataagattagagctg 
accatgagaaagccctggaagaagcaaaagaaaaattaagaaagtcaagagaggaaatccgtgcagaaattcagacaga 
5 gaaaaacaaagtagcccaagcaatgaagacaaaagagaccagggtactgccgcctgtccctgtcccacaacgtgtaggg 
gtcagtggtggggatccagaagacatggagatcaagaagaaaagagacaaaattaaagagatgatgaaacatgcctggg 
ataattacagaacatacggatggggacataatgaactaaggcctattgcaaggaaaggccattccactaacatattcggaag 
ctcacagatgggtgccaccatagtggatgctttggataccctttatatcatggggcttcatgatgaattcatggatgggcaaag 
atggattgaagaaaaccttgatttcagtgtgaattcagaagtgtctgtctttgaagttaacattcgctttattggagggctcctcg 

10 ctgcatattacctgtcaggagaggaaatattcaagactaaagcagtgcagttggctgagaaactccttcctgcctttaacacac 
ctactgggattccctgggcaatggtgaacctgaaaagtggagtaggtcgaaactggggctgggcgtctgcaggcagcagc 
. atcctggctgagttcggcaccctgcacatggagtttgtgcacctcagctacttgaccggtgacttgacttactataataaggtc 
atgcacattcggaaactactgcagaaaatggaacgcccaaatggtctttatccaaattatttaaacccaagaacagggcgctg 
gggtcagtatcacacatcagttggtggtctgggagatagtttttatgaatacttactgaaagcatggctgacgtcagataaaac 

15 agaccacgaggcaagaaggatgtatgacgatgctgttgaggctatagaaaaacatGttattaagaagtcccgaggaggtct 
ggtttttattggagaatggaagaatggacacttggaaaggaagatggggcacttggcctgctttgctgggggaatgcttgcc 
cttggagcagatggttccagaaaggataaagctggccactacttagaactaggggcagaaattgcacgaacatgtcatgag 
tcatatgacagaactgcattgaaactaggtccggagtcattcaagtttgatggtgcagtggaagccgtggctgtgcggcagg 
ctgaaaagtattacatccttcgtccagaagtaattgaaacctattggtatctatggcgatttacccacgacccaagatacaggc 

20 agtggggctgggaagcagcactggctattgagaagtcgtgccgggtcagcggtgggttttctggtgtcaaggatgtatacg 

ccccgacccctgtgcatgacgacgtgcagcagagcttttctcttgctgaaacattaaaatacttgtacctgctgttctctggcga 

tgaccttctacctttagaccactgggtgtttaacacagaggcgcaccctctgccggtgttgcgcttagccaacagcactctttc 
. aggtaatcctgctgtccgacacgacgagttgtaa 

25 SEQ ID NO: 16 pAOX2ZAGLSII 

catggccaagttgaccagtgccgttccggtgctcaccgcgcgcgacgtcgccggagcggtcgagttctggaccgaccgg 
ctcgggttctcccgggacttcgtggaggacgacttcgccggtgtggtccgggacgacgtgaccctgttcatcagcgcggtc 
caggaccaggtggtgccggacaacaccctggcctgggtgtgggtgcgcggcctggacgagctgtacgccgagtggtcg 
gaggtcgtgtccacgaacttccgggacgcctccgggccggccatgaccgagatcggcgagcagccgtgggggcggga 
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15 



20 



25 



gttcgccctgcgcgacccggccggcaactgcgtgcacttcgtggccgaggagcaggactgacacgtccgacggcggcc 
cacgggtcccaggcctcggagatccgtccccctwccmgtcgatatcatgtaattagttatgtcacgcttacattcacgcc^ 
ccccccacatccgctctaaccgaaaaggaaggagttagacaacctgaag^ctaggtccctatttatt^ 
attaagaacgttatttatatttcaaattwcmtttttctgtacagacgcgtgtacgcatgtaac^^^^ 

5 Saaggtttlgggacgctcgaaggcbtaatttgcaagctggagaccaacatgtgagcaaaaggccagcaaaaggccagga 
accgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaa^tca 
gaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgac 
cctgccgcttaccggatacctgtccgccmctcccttcgggaagcgtggcgcmctcaatgctcacgctgtaggtatctca 
tcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaart 
10 atcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggt 
atgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtaWg^^^ 
ctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtt 
tgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtgga 

acgaaaactcacgttaagggattttggtcatgagatcagatctttttttcagaccatatgaccggtccatcttctacggggggat 
tatctatgctttgacctctatcttgattctWatgattcaaatcactmacgttatttattacttactggrt^^ 
aaaaacaWactaaaaatcatacatcggcactctcaaacacgacagattgtg^^ 

gttgcacatttgagccagtaggctcctaatagaggttcgatacttattttgataatacgacatattgtcttacctctgaatgtgtca 

at ^tctctcgttcttcgtctcgtcagctaaaaatataacacttcgagtaagatacgcccaattgaaggctacgagataccaga 

ctatcactagtagaactttgacatctgctaaagcagatcaaatatccatttatccagaatcaattaccttcctttagcttgtcgalg 

gcatgaaaaagctacatgaaaatccccatccttgaagttttgtcagcttaaaggactccatttcctaaaatttcaagcagtcctc°t 

caactaaatttttttccattcctctgcacccagccctcttcatcaaccgtccagccttctcaaaagtccaatgtaagtagcctgca 

aattcaggttacaacccctcaattttccatccaagggcgatccttacaaagttaatatcgaacagcagagactaagcgagtcat 

catcaccacccaacgatggtgaaaaactttaagcatagattgatggagggtgtatggcacttggcggctgcattagagtttga 

aactatggggtaatacatcacatccggaactgatccgactccgagatcatatgcaaagcacgtgatgtaccccgtaaac^ 

tcggattatcgttgcaattcatcgtcftaaacagtacaagaaactttattcatgggtcattggactctgatgaggggcaca^ 

ccaatgattttttgggaaagaaagccgtaagaggacagttaagcgaaagagacaagacaacgaacagcaaaagtgacag 

ctgtcagctacctagtggacagttgggagtttccaattggttggttttgaattmacccatgttgagttgtccttgcttctcctt^ 

aacaatgcaagttgataagacatcaccttccaagataggctatttttgtcgcataaatttttgtctcggagtgaaaaccccttttat 

gtgaacagattacagaagcgtcctacccttcaccggttgagatggggagaaaattaagcgatgaggagacgattattggtat 
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aaaagaagcaaccaaaatcccttattgtccttttctgatcagcatcaaagaatattgtcttaaaacgggcttttaactacattgttc 
ttacacattgcaaacctcttccttctamcggatcaactgtattgactacattgatcttttttaacgaagmacgacttactaa 
ccacaaacaaatcaactgagaaaagaattcacgtggcccagccggccgtctcggatcggtacctcgagatggtccttttga 
aatggctcgtatgccaattggtcttctttaccgctt 

5 tttgccatagaaacagggtttatgcagaaaatattgccaaatctcatcactgctattacaaagtggacgccgagtctattgcac 
acgatcctttagagaatgtgcttcatgctaccataattaaaactataccaagattggagggcgatgatatagccgttcagttccc 
attctctctctcttmtacaggatcactcagtaaggttcactataaatgagaaagagagaatgccaaccaacagcagcggtttg 
ttgatctcttcacaacggttcaatgagacctggaagtacgcattcgacaagaaatttcaagaggaggcgaacaggaccagta 
ttccacaattccacttccttaagcaaaaacaaactgtgaactcattctggtcgaaaatatcttcatttttgtcactttcaaactccac 
10 tgcagacacafficatcttcgaaacggtgatgtato 

cgctgaaacttattgtaaacgagcaaaatttcctgaacattgaacatcatagaactaagcaggaaaacttcgcacacgtgctg 
ccagaagaaacaactttcaacatgtttaaggacaatttcttgtattcaaagcatgactctatgcctttggggcctgaatcggttg 
cgctagatttctctttcatgggttctactaatgtctacggtataccggaacatgcgacgtcgctaaggctgatggacacttcagg 
tggaaaggaaccctacaggcttttcaacgttgatgtctttgagtacaacatcggtaccagccaaccaatgtacggttcgatcc 

1 5 cattcatgttttcatcttcgtccacatctatcttttgggtcaatgcagctgacacttgggtagacataaagtatgacaccagta^ 
aataaaacgatgactcattggatctccgaaaatggtgtcatagatgtagtcatgtcectggggccagatattccaactatcattg 
acaaatttaccgatttgactggtagaccctttttaccgcccatttcctctatagggtaccatcaatgtagatggaattataatgatg 
agatggacgttctcacagtggactctcagatggatgctcatatgattccttacgattttatttggttggacttggagtatacgaac 
gacaaaaaatattttacttggaagcagcactcctttcccaatccaaaaaggctgttatccaaattaaaaaagttgggtagaaat 

20 cttgtcgtactaatcgatcctcatttaaagaaagattatgaaatcagtgacagggtaattaatgaaaatgtagcagtcaaggatc 
acaatggaaatgactatgtaggtcattgctggccaggtaattctatatggattgataccataagcaaatatggccaaaagattt 
ggaagtcctwtcgaacggmatggatctgccggctgamaactaatttattcatttggaatgatatgaacgagccttcgatttt 
cgatggcccagagaccacagctccaaaagatttgattcacgacaattacattgaggaaagatccgtccataacatatatggtc 
tatcagtgcatgaagctacttacgacgcaataaaatcgatttattcaccatccgataagcgtcctttccttctaacaagggcttttt 

25 ttgccggctctcaacgtactgctgccacatggactggtgacaatgtggccaattgggattacttaaagatttccattcctatggt 
tctgtcaaacaacattgctggtatgccatttataggagccgacatagctggctttgctgaggatcctacacctgaattgattgca 
cgttggtaccaagcgggcttatggtacccattttttagagcacacgcccatatagacaccaagagaagagaaccatacttatt 
caatgaacctttgaagtcgatagtacgtgatattatccaattgagatatttcctgctacctaccttatacaccatgtttcataaatca 

agtgtcactggatttccgataatgaatccaatgtttattgaacaccctgaatttgctgaattgtatcatatcgataaccaattttact 
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gg^aattcaggtctattagtcaaacctgtcacggagcctggtcaatcagaaacggaaatggttttcccacccggtatattct 
atgaattcgcatctttacactcttttataaacaatggtactgatttgatagaaaagaatatttctgcaccattggataaaattccatt 
atttattgaaggcggtcacattatcactatgaaagataagtatagaagatcttcaatgttaatgaaaaacgatccatatgtaatag 
ttatagcccctgataccgagggacgagccgttggagatctttatgttgatgatggagaaacttttggctaccaaagaggtgag 
5 tacgtagaaactcagttcattttcgaaaacaataccttaaaaaatgttcgaagtcatattcccgagaatttgacaggcattcacc 
acaatactttgaggaataccaatattgaaaaaatcattatcgcaaagaataatttacaacacaacataacgttgaaagacagta 
ttaaagtcaaaaaaaatggcgaagaaagttcattgccgactagatcgtcatatgagaatgataataagatcaccattcttaacc 
tatcgcttgacataactgaagattgggaagttatttttgggcccgaaGaaaaactcatctcagaagaggatctgaatagcgcc 
gtcgaccatcatcatcatcatcattgagttttagccttagacatgactgttcctcagttcaagttgggcacttacgagaagaccg 

10 gtcttgctagattctaatcaagaggatgtcagaatgccatttgcctgagagatgcaggcttcatttttgatacttttttatttgtaac 
ctatatagtataggatttWttgtcattttgmcttctcgtacgagcttgctcctgatcagcctatctcgcagctgatgaatatcttgt 
ggtaggggtttgggaaaatcattcgagtttgatgtttttcttggtatttcccactcctcttcagagtacagaagattaagtgagac 
cttcgtttgtgcggatcccccacacaccatagcttcaaaatgtttctactccttttttactcttccagattttctcggactccgcgca 
tcgccgtaccacttcaaaacacccaagcacagcatactaaattttccctctttcttcctctagggtgtcgttaattacccgtacta 

15 aaggmggaaaagaaaaaagagaccgcctcgtttcttWcttcgtcgaaaaaggcaataaaaatttttatcacgtttctttttctt 

gaaatttttttWtagtttttttctctttcagtgacctccattgatatttaagttaataaacggtcttcaatttctcaagWcagmcaW 

ttcttgttctattacaactttttttacttcttgttcattagaaagaaagcatagcaatctaatctaagggcggtgttgacaattaatcat 
cggcatagtatatcggcatagtataatacgacaaggtgaggaactaaac 

20 SEQIDNO:17 pAOX2ADElglsII 

tcgaccggctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctcac 
tgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatca 
ggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggc 
gtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggact 
25 ataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccg 
cctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagct 
gggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaaga 
cacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttg 
aagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaag 
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agttggtagctcttgatccggcaaacaaaccaccgetggtagcggtg gLLLLLLLgLLl gcaagcagcagattacgcgcagaa 
aaaaaggatctcaagaagatccmgatcttttctacg'gggtctgacgctcagtggaacgaaaactcacgttaagggattttgg 
tcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttltaaatcaatctaaagtatatatgagtaaa 
cttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctamcgttcatccatagttgcc^ 
5 ccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcac 
cggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctcca 
tccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacagg 
catcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatcccccat 
gttgtgcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaagttggccgcagtgttatcactcatggttatg 

10 gcagcactgcataattctcttactgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattctga 
atagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtg 
ctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcgt 
gcacccaactgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaa 
gggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaatagctccaaggcaacaaattgactactcagac 

15 cgacattcattcgttattgattttaaatcaacgataaacggaatggttacttgaatgatttcactttatgatcattgtttactaattacc 
taaataggatmatatggaattggaagaataagggaaamcagatgtctgaa'aaaggcgaggagggtactaatcattcaag 
cccamcttgccagtaattgcttcataagcttcaatatactmctttactcttgatagcaatttctgcatccatggctacgccctctt 
tgccattcaatccgttggccgtcaaccaatctctgagaaactgcttatcgtaactctcttgcgatttacccacttggtaagtctttt 
gattccaaaatctagaagaatctggagttaaaacttcatctactagtaccaattcattgttttcgtccagtccaaattcgaatttcgt 

20 atcagcaataatgatccccttcaaaagggcgaagttttttgcagcagaatacaactcgaccgccttgacagcgaccttctcac 
aaatgtctttacctacaatctcagcagcttgttcaatagagatgttttcatcgtgttcaccctgttcagctttcgttgaaggtgtgaa 
aatcggagttggaaaggcgtcgctctcttgaaggttctcgmtcaaccttgactccatggacagtttttgagttcttgtactctttc 
catgcacttccagtgatgtaacctctgacaatggcttccaaaggtatcagtctgtgcttttttactatcaaggatcgtccctctaat 
tgagamgtatttttcttcagacagttttgatggtagtaaagcaaagacttccttgtcattagaagcaaccaaatgattcmatg^ 

25 gg^gccaaaaaatGaaaccagaaaactgagagctgagtcaaaatctttcccttatcaggaataccgtttgtcataatcacatc 

gtaagcggagatacggtcagttgcgacgaacagcaagttgttctcatcgactgcataaatgtctctaacctttcctttggcgatt 

aaaggtaggattccgtccagatcagtgttcacaatggacatacttggaaggatacagcaaagtgtgttggaagcgatgacac 

atggaaaggaatttttcgagtttcctagagtagtatattggggcggtgaaagttcagatgtttaatgcttaatactcttatactcttc 

aaagcgcccaagtgtttctgccaacctgacttttttctgaataatgaatcgttcaagtggagtatttaaaccatgattaagttacgt 
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gatttggcactggataaggtcgaaaaatatccgtattcataaacgattattggtaaaagttacaaaataccactaattacggag 

aagcttagtaacagttatcatctcttggtcgattaacgcttacaatttccattcgccattcaggctgcgcaactgtt^ 
cgatcggjgcgggcctcttcgctattacgc^^ 

ggagtgggaaataccaagaaaaacatcaaactcgaatgattttcccaaacccctaccacaagatattcatcagctgcgagat 
5 aggctgatcaggagcaagctcgtacgagaagaaacaaaatgacaaaaaaaatcctatactatataggttacaaataaaala 
gtatcaaaaatgaagcctgcatctctcaggcaaatggcattctgacatcctcttgattagaatctagcaagaccggtcttctcgt 
aagtgcccaacttgaactgaggaacagtcatgtctaaggctaaaactcaatgatgatgatgatgatggtcga^ 

agatcctcttctgagatgagttmgttcgggcccaaaaataacttcccaatcttcagttatgtcaagcgataggttaagaatggt 
gatcttattatcattctcatatgacgatctagtcggcaatgaacmcttcgccatttttmgacwaatac^ 

10 tgtgttgtaaattattctttgcgataatgattttttcaatattggtattcctcaaagtattgtggtgaatgcctgtcaaattctcgggaa 
tatgacttcgaacatttWaaggtattgtmcgaaaatgaactgagtttctacgtactcacctcmggtag^ 
tcatcaacataaagatctccaacggctcgtccctcggtat^ggggctataactattacatatggatcgtttttcattaac^^^ 
agatcttctatacttatctttcatagtgataatgtgaccgccttcaataaataatggaattttatccaatggtgcagaaatattctttt 
ctatcaaatcagtaccattgtttataaaagagtgtaaagatgcgaattcatagaatataccgggtgggaaaaccatttccgtttct 

15 gattgaccaggctccgtgacaggtttgactaatagacctgaattactccagtaaaattggttatcgatatgatacaattcagcaa 
attcagggtgttcaataaacattggattcattatcggaaatccagtgacacttgatttatgaaacatggtgtataaggtaggtag 
caggaaatatctcaattggataatatcacgtactatcgacttcaaaggttcattgaataagtatggttctettctcttggtgtctata 
tgggcgtgtgctctaaaaaatgggtaccataagcccgcttggtaccaacgtgcaatcaattcaggtgtaggatcctcagcaa 
agccagctatgtcggctcctataaatggGataccagcaatgttgtttgacagaaccataggaatggaaatctttaagtaatccc 

>0 aattggccacattgtcaccagtccatgtggcagcagtacgttgagagccggcaaaaaaagcccttgttagaaggaaaggac 

gcttatcggatggtgaataaatcgattttattgcgtcgtaagtagcttcatgcactgatagaccatatatgttatggacggatcttt 

cctcaatgtaattgtcgtgaatcaaatcttttggagctgtggtctctgggccatcgaaaatcgaaggctcgttcatatcattccaa 

atgaataaattagttaaatcagccggcagatccataaaccgttcgaaaaaggacttccaaatcttttggccatatttgcttatggt 
atcaatecatatagaattacctggccageaa^^ 

5 ctgtcactgatttcataatctttctttaaatgaggatcgattagtacgacaagatttctacccaacttttttaatttggataacagcct 

ttttggattgggaaaggagtgctgcttccaagtaaaatattttttgtcgttcgtatactccaagtccaaccaaataaaatcgtaag 

gaatcatatgagcatccatctgagagtccactgtgagaacgtccatctcatcattataattccatctacattgatggtaccctata 

gaggaaatgggcggtaaaaagggtctaccagtcaaatcggtaaatttgtcaatgatagttggaatatctggccccagggaca 

tgactacatctatgacaccattttcggagatccaatgagtcatcgttttatttttactggtgtcatactttatgtctacccaagtgtca 
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gctgcattgacccaaaagatagatgtggacgaagatgaaaacatgaatgggatcgaaccgtacattggttggctggtaccg 

atgttgtactcaaagacatcaacgttgaaaagcctgtagggttccmccacctgaagtgtccatcagccttagcgacgtc 

tgttccggtataccgtagacattagtagaacccatgaaagagaaatctagcgcaaccgattcaggccccaaaggcatagag 

tcatgcmgaatacaagaaattgtccttaaacatgttgaaagttgmcttctggcagcacgtgtgcgaagtm 

5 tatgatgttcaatgttcaggaaatmgctcgtttacaataagmcagcgcatWgccagtaaacmcaatt 

aaagatttctacggatacatcaccgtttcgaagatgaaatgtgtctgcagtggagtttgaaagtgacaaaaatgaagatattttc 
gaccagaatgagttcacagtttgtttttgcttaaggaa 

cgaatgcgfacttccaggtctcattgaaccgtt^^ 

atagtgaaccttactgagtgatcctgtaaaaaagagagagagaatgggaactgaacggctatatcatcgccctccaatcttg 

10 gtatagttttaattatggtagcatgaagcacattctctaaaggatcgtgtgcaatagactcggcgtccacmgtaatagcagt^ 
atgagatttggcaatattttctgcataaaccctgtttctatggcaaaacccagattgcgcacacttctttaatagatagtcggtaa 
acgcatgcgaaaaagcggtaaagaagaccaattggcatacgagccatttcaaaaggaccatctcgaggtaccgatccgag 
acggccggctgggccacgtgaattctmctcagttgatttgWgtggggamagtaagtcgtaaacttcgttaaaaaagatc 
atgtagtcaatacagttgatccgaaatagaaggaagaggtttgcaatgtgtaagaacaatgtagttaaaagcccgttttaagac 

1 5 aatattcmgatgctgatcagaaaaggacaataagggatmggttgcttcttttataccaataatcgtctcctcatcgcttaatttt 
ctccccatctcaaccggtgaagggtaggacgcttctgtaatctgttcacataaaaggggttttcactccgagacaaaaatttat 
gcgacaaaaatagcctatcttggaaggtgatgtcttatcaacttgcattgtttgcaaggagaagcaaggacaactcaacatgg 
gtaaaaattcaaaaccaaccaattggaaactcccaactgtccactaggtagctgacagctgtcactmgctgttcgttgtcttgt 
ctcWcgcttaactgtcctcttacggctttctttcccaaaaaatcattggggaaatgtgcccctcatcagagtccaatgacccat 

20 gaataaagtttcttgtactgtttaagacgatgaattgcaacgataatccgagcagtttacggggtacatcacgtgctttgcatat 
gatctcggagtcggatcagttccggatgtgatgtattaccccatagtttcaaactctaatgcagccgccaagtgccatacacc 
ctccatcaatctatgcttaaagttWcaccatcgttgggtggtgatgatgactcgcttagtctctgctgttcgatattaactttg^ 
ggatcgcccttggatggaaaattgaggggttgtaacctgaatttgcaggctacttacattggacttttgagaaggctggacgg 
ttgatgaagagggctgggtgcagaggaatggaaaaaaatttagttgagaggactgcttgaaattttaggaaatggagtccttt 

25 aagctgacaaaacttcaaggatggggattttcatgtagcmttcatgcGttcgacaagctaaaggaaggtaattgattctggat 
aaatggatatttgatctgctttagcagatgtcaaagttctactagtgatagtctggtatctcgtagccttcaattgggcgtatctta 
ctcgaagtgttatatttttagctgacgagacgaagaacgagagagtattgacacattcagaggtaagacaatatgtcgtattat 
caaaataagtatcgaacctctattaggagcctactggctcaaatgtgcaaccttagtggtgattgtctctgcttcttgatcacaat 
ctgtcgtgmgagagtgccgatgtatgatttttagtaaatgtt^ 
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^cgtaaaagtgatttgaatcataaaagaatcaagatagaggtcaaagcatagataatccccccgtagaagatggaccggtc 
atatggtctgaaaaaaagatctgatctcatg 

SEQ ID NO: 18 pGAPZAGLSII 
5 tegagatggte^gaaatggctcgtatgccaattggtcttetttaccgc^ 

gtgtgcgcaatctgggttttgccatagaaacagggtttatgcagaaaatattgccaaatctcatcactgctattacaaa^ 

cgccgagtctattgcacacgatcctttdgagaatgtgcttcatgctaccataattaaaactataccaagattggagggcgatga 
tata gccgttcagttcc^^ 

ccaacagcagcggtttgttgatctcttcacaacggttcaatgagacctggaagtacgcattcgacaagaaatttcaagagga 

10 ggcgaacaggaccagtattccacaattccacttccttaagcaaaaacaaactgtgaactcattctggtcgaaaatatcttcattt 

ttgtcactttcaaactccactgcagacacatttcatcttcgaaacggtgatgtatccgtagaaatctttgctgaaccttttcaattg 

aaagtttactggcaaaatgcgctgaaacttattgtaaacgagcaaaatttcctgaacattgaacatcatagaactaagcagga 

aaacttcgcacacgtgctgccagaagaaacaactttcaacatgtttaaggacaatttcttgtattcaaagcatgactctat g rctt 

tggggcctgaatcggttgcgctagamctctttcatgggttctactaatgtctacggtataccggaacatgcgacgtcgc^ 
15 g ct g at gg a ^ttcaggtggaaaggaaccctaca^^^ 

caatgtacggttcgatcccattcatgttttcatcttcgtccacatctatctmgggtcaatgcagctgacacttggg^ 
agtatgacaccagtaaaaataaaacgatgactcattggatctccgaaaatggtgtcatagatgtagtcatgtccctggggcca 
gatattccaactatcattgacaaatttaccgatttgactggtagaccctttttaccgcccatttcctctatagggtaccatcaatgta 
gatggaattataatgatgagatggacgttctcacagtggactctcagatggatgctcatatgattccttacgattttatttggttgg 
10 acttggagtatacgaacgacaaaaaatattttacttggaagcagcactcctttcccaatccaaaaaggctgttatccaaattala 
aaagttgggtagaaatcttgtcgtactaatcgatcctcatttaaagaaagattatgaaatcagtgacagggtaattaatgaaaat 
gtagcagtcaaggatcacaatggaaatgactatgtaggtcattgctggccaggtaattctatatggattgataccataagcaaa 
tatggccaaaagatttggaagtcctttttcgaacggtttatggatctgccggctgatttaactaatttattcatttggaatgatatga 
acgagccttcgattttcgatggcccagagaccacagctccaaaagatttgattcacgacaattacattgaggaaagatccgtc 
5 cataacatatatggtctatcagtgcatgaagctacttacgacgcaataaaatcgamattcaccatccgataagcgtc 

tctaacaagggctttttttgccggctctcaacgtactgctgccacatggactggtgacaatgtggccaattgggattacttaaag 
atttccattcctatggttctgtcaaacaacattgctggtatgccatttataggagccgacatagctggctttgctgaggatcctac 
acctgaattgattgcacgttggtaccaagcgggcttatggtacccattttttagagcacacgcccatatagacaccaagagaa 
gagaaccatacttattcaatgaacctttgaagtcgatagtacgtgatattatccaattgagatatttcctgctacctaccttataca 

62 



WO 02/00856 



PCT/TBOl/01514 



ccatgmcataaatcaagtgtcactggamccgataatgaatcc^^ 

cgataaccaatWactggagtaattcaggtctattagtcaaa 

ccacccggtatattctatgaattcgcatcmac 

tggataaaattccattatttattgaaggcggtcacattatcactatgaaagataagtatagaagatcttcaatgttaatgaaaaac 
5 gatccatatgtaatagttatagcccctgata^^ 

accaaagaggtgagtacgtagaaactcagttcattttcgaaaacaataccttaaaaaatgttcgaagtcatattcccgagaattt 

gacaggcattcaccacaatactttgaggaataccaatattgaaaaaatcattatcgcaaagaataatttacaacacaacataac 

gttgaaagacagtattaaagtcaaaaaaaatggcgaagaaagttcattgccgactagatcgtcatatgagaatgataataaga 
tcaccattcttaacctatcgcttgacataactgaagatt^^ 

10 atctgaatagcgccgtcgaccatcatcatcatcatcattgagttttagccttagacatgactgttcctcagttcaagttgggcact 

tacgagaagaccggtcttgctagattctaatcaagaggatgtcagaatgccatttgcctgagagatgcaggcttcatttttgata 
cmtttamgtaacctatatagtataggattt^ 

gatgaatatcttgtggtaggggmgggaaaatcattcgagtttgatgtttttcttggtamcccactcctcttc^ 

gattaagtgagaccttcg^gtgcggatcccccacacaccatagcttcaaaatgtttctactcctttWactcttccagatttt 
1 5 ggactccgcgcatcgccgtaccacttcaaaacac^^ 

attacccgtactaaaggtttggaaaagaaaaaagagaccgcctcgmcttWcttcgtcgaaaaaggcaataaaaattttt 

acgtttcmttcttgaaattlUtUWagttttmctcmcagtgacctccattgatamaagttaataaacggtcttcaamctca^ 
gtttcagmcattmcttgttctattacaac^ 

tgacaattaatcatcggcatagtatatcggcatagtataatacgacaaggtgaggaactaaaccatggccaagttgaccagtg 

20 ccgttccggtgctcaccgcgcgcgacgtcgccggagcggtcgagttctggaccgaccggctcgggttctcccgggacttc 

gtggaggacgacttcgccggtgtggtccgggacgacgtgaccctgttcatcagcgcggtccaggaccaggtggtgccgg 

acaacaccctggcctgggtgtgggtgcgcggcctggacgagctgtacgccgagtggtcggaggtcgtgtccacgaacttc 

cgggacgcctccgggccggccatgaccgagatcggcgagcagccgtgggggcgggagttcgccctgcgcgacccgg 

ccggcaactgcgtgcacttcgtggccgaggagcaggactgacacgtccgacggcggcccacgggtcccaggcctcgga 

25 gatccgtcccccttttcctttgtcgatatGatgtaattagttatgtcacgcttacattcacgccctccccccacatccgctctaacc 

gaaaaggaaggagttagacaacctgaagtctaggtccctatttattttmatagttatgttagtattaagaacgttamatattt 

aatWtcmtttttctgtacagacgcgtgtacgcatgtaacattatactgaaaaccttgcttgagaaggttttgggacgctcgaa 

ggctttaatttgcaagctggagaccaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttg 

ctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgaca 
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ggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacct 

gtccgccmctcccttcgggaagcgtggcgctttctcaatgctcacgctgtaggtatctcagttcggtgtaggtcgttcgctcc 

aagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccgg 

taagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagag 

5 ttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcgga 

aaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcg 

cagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaaggg 

atmggtcatgcatgagatcagatcttttttgtagaaatgtcttggtgtcctcgtccaatcaggtagccatctctgaaatatctgg 

ctccgttgcaactccgaacgacctgctggcaacgtaaaattctccggggtaaaacttaaatgtggagtaatggaaccagaaa 

10 cgtctcttcccttctctctccttccaccgcccgttaccgtccctaggaaattttactctgctggagagcttcttctacggccccctt 

gcagcaatgctcttcccagcattacgttgcgggtaaaacggaggtcgtgtacccgacctagcagcccagggatggaaaag 

tcccggccgtcgctggcaataatagcgggcggacgcatgtcatgagattattggaaaccaccagaatcg'aatataaaaggc 

gaacacctttcccaattttggtttctcctgacccaaagactttaaatttaatttatttgtccctatttcaatcaattgaacaactatttc 
gaaacgaggaattcacgtggcccagccggccgtctcggatcggtacc 

15 

SEQ ID NO: 19 pGAPADElglsII 

tcgaccggctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctcac 
tgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatca 
ggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggc 

20 gtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggact 
ataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccg 
cctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagct 
gggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaaga 
cacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttg - 

25 aagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaag 
agttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaa 
aaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttgg 
tcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagtaaa 
cnggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactcc 

64 



WO 02/00856 PCT/IB01/01514 



ccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcac 
cggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctcca 
tccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagWgcgcaacgttgttgccattgcta 
catcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatcccccat 
5 gttgtgcaaaaaagcggttagctccttcggtcctccgatc^ 

gcagcactgcataattctcttactgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattctgaga 
atagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtg 
ctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcgt 
gcacccaactgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaa 
10 gggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaatagctccaaggcaacaaattgactactcagac 
cgacattcattcgttattgattttaaatcaacgataaacggaatggttacttgaatgatttcactttatgatcattgmactaatt 
taaataggattttatatggaattggaagaataagggaaatttcagatgtctgaaaaaggcgaggagggtactaatcattcaag 
cccatttcttgccagtaattgcttcataagcttcaatatactttte 

tgccattcaatccgttggccgtcaaccaatctctgagaaactgcttatcgtaactctcttgcgatttacccacttggtaagtctttt 
15 gattccaaaatctagaagaatctggagttaaaacttcatctactagtaccaattcattgttttcgtccagtccaaattcgaatttcgt 
atcagcaataatgatccccttcaaaagggcgaagttttttgcagcagaatacaactcgaccgccttgacagcgaccttctcac 
aaatgtcWacctacaatctcagcagcttgttcaatagagatgttttcatcgtgttcaccctgttcagctttcgttgaaggtgtgaa 
aatcggagttggaaaggcgtcgctctcttgaaggttctcgtmcaac^^ 

catgcacttccagtgatgtaacctctgacaatggcttccaaaggtatcagtctgtgctttmactatcaaggatcgtccctctaat 

20 tgagatttgtatttttcttcagacagttttgatggtagtaaag^ 

gggtgccaaaaaatcaaaccagaaaactgagagctgagtcaaaatctttcccttatcaggaataccgtttgtcataatcacatc 
gtaagcggagatacggtcagttgcgacgaacagcaagttgttctcatcgactgcataaatgtctctaacctttcctttggcgatt 
aaaggtaggattccgtccagatcagtgttcacaatggacatacttggaaggatacagcaaagtgtgttggaagcgatgacac 
atggaaaggaatttttcgagtttcctagagtagtatattggggcggtgaaagttcagatgtttaatgcttaatactcttatactcttc 

25 aaagcgcccaagtgtttctgccaacctgacttmtctgaataatgaatcgttcaagtggagtatttaaaccatgattaagttacgt 
gatttggcactggataaggtcgaaaaatatccgtattcataaacgattattggtaaaagttacaaaataccactaattacggag 
aagcttagtaacagttatcatctcttggtcgattaacgcttacaatttccattcgccattcaggctgcgcaactgttgggaaggg 
cgatcggtgcgggcctcttcgctattacgccagggcctcgaggcacaaacgaacgtctcacttaatcttctgtactctgaaga 

ggagtgggaaataccaagaaaaacatcaaactcgaatgattttcccaaacccctaccacaagatattcatcagctgcgagat 
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aggctgatcaggagcaagctcgtacgagaagaaacaaaatgacaaaaaaaatcctatactatataggttacaaataaaaaa 
gtatcaaaaatgaagcctgcatctctcaggcaaatggcattctgacatcctcttgattagaatctagcaagaccggtcttctcgt 
aagtgcccaacttgaactgaggaacagtcatgtctaaggctaaaactcaatgatgatgatgatgatggtcgacggcgctattc 
agatcctcttctgagatgagttWgttcgggcccaaaaataacttcccaatcttcagttatgtcaagcgataggtta 
5 gatcttattatcattctcatatgacgatctagtcggcaatgaactttcttcgccattttttttgactttaatactgtctttcaacgttatgt 
tgtgttgtaaattattcWgcgataatgatttmcaatattggtattcctcaaagtattgtggtgaatgcctgtcaaattct^ 
tatgacttcgaacatt^aaggtattgtmcgaaaatgaactgagtttcta^ 

tcatcaacataaagatctccaacggctcgtccctcggtatcaggggctataactattaeatatggatcgtttttcattaacattga 
agatcttctatacttatctttcatagtgataatgtgaccgccttcaataaataatggaattttatccaatggtgcagaaatattctttt 
1 0 ctatcaaatcagtaccattgtttataaaagagtgtaaagatgcgaattcatagaatataccgggtgggaaaaccatttccgtttct 
gattgaccaggctccgtgacaggtttgactaatagacctgaattactccagtaaaattggttatcgatatgatacaattcagcaa 
attcagggtgttcaataaacattggattcattatcggaaatccagtgacacttgatttatgaaacatggtgtataaggtaggtag 
caggaaatatctcaattggataatatcacgtactatcgacttcaaaggttcattgaataagtatggttctcttctcttggtgtctata 
tgggcgtgtgctctaaaaaatgggtaccataagcccgcttggtaccaacgtgcaatcaattcaggtgtaggatcctcagcaa 

1 5 agccagctatgtcggctcctataaatggcataccagcaatgttgtttgacagaaccataggaatggaaatctttaagtaatccc 
aattggccacattgtcaccagtccatgtggcagcagtacgttgagagccggcaaaaaaagcccttgttagaaggaaaggac 
gcttatcggatggtgaataaatcgattttattgcgtcgtaagtagcttcatgcactgatagaccatatatgttatggacggatcttt 
cctcaatgtaattgtcgtgaatcaaatcttttggagctgtggtctctgggccatcgaaaatcgaaggctcgttcatatcattccaa 
atgaataaattagttaaatcagccggcagatccataaaccgttcgaaaaaggacttccaaatcttttggccatatttgcttatggt 

20 atcaatccatatagaattacctggccagcaatgacctacatagtcatttccattgtgatccttgactgctacattttcattaattacc 
ctgtcactgatttcataatctttctttaaatgaggatcgattagtacgacaagatttctacccaacttttttaatttggataacagcct 
ttttggattgggaaaggagtgctgcttccaagtaaaatattttttgtcgttcgtatactccaagtccaaccaaataaaatcgtaag 
gaatcatatgagcatccatctgagagtccactgtgagaacgtccatctcatcattataattccatctacattgatggtaccctata 
gaggaaatgggcggtaaaaagggtctaccagtcaaatcggtaaatttgtcaatgatagttggaatatctggccccagggaca 

25 tgactacatctatgacaccatmcggagatccaatgagtcatcgtmattmactggtg^catactttatgtctacccaagtgtca 

gctgcattgacccaaaagatagatgtggacgaagatgaaaacatgaatgggatcgaaccgtacattggttggctggtaccg 

atgttgtactcaaagacatcaacgttgaaaagcctgtagggttcctttccacctgaagtgtccatcagccttagcgacgtcgca. 

tgttccggtataccgtagacattagtagaaceeatgaaagagaaatctagcgcaaccgattcaggccccaaaggcatagag 

tcatgctttgaatacaagaaattgtccttaaacatgttgaaagttgtttcttctggcagcacgtgtgcgaagttttcctgcttagttc 
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tatgatgttcaatgttcaggaaatmgctcgtttacaa 

aaagamctacggatacatcaccgtttcgaagatgaaatgtgtctgcagtggagtttgaaagtgacaaaaatgaagatattttc 

gaccagaatgagttcacagtttgttmgcttaaggaagtgg 

cgaatgcgtacttccaggtctcattgaacc^ 

5 atagtgaaccttactgagtgatcctgtaaaaaagagagagagaatgggaactgaacggctatatcatcgccctccaatcttg 
gtatagttttaattatggtagcatgaagcacattctctaaaggatcgtgtgcaatagactcggcgtccactttgtaatagcagtg 
atgagatttggcaatatmctgcataaaccctgM^ 

acgcatgcgaaaaagcggtaaagaagaccaattggcatacgagccatttcaaaaggaccatctcgaggtaccgatccgag 
acggccggctgggccacgtgaattcctcgtttcgaaatagttgttcaattgattgaaatagggacaaataaattaaatttaaagt 
10 ctttgggtcaggagaaaccaaaattgggaaaggtgttcgcctmatatt^^ 

tccgcccgctattattgccagcgacggccgggacttttccatccctgggctgctaggtcgggtacacgacctccgttttaccc 
gcaacgtaatgctgggaagagcattgctgcaagggggccgtagaagaagctctccagcagagtaaaatttcctagggacg 
gtaacgggcggtggaaggagagagaagggaagagacgtttctggttccattactccacaWaagttttaccccggagaattt 
tacgttgccagcaggtcgttcggagttgcaacggagccagatatttcagagatggctacctgattggacgaggacaccaag 
15 acatttctacaaaaaagatctgatctca 

SEQ ID NO: 20 pPICZAGLSII 

cgaacaaaaactcatctcagaagaggatctgaatagcgccgtcgaccatcatcatcatcatcattgagtttgtagccttagac 
atgactgttcctcagttcaagttgggcacttacgagaagaccggtcttgctagattctaatcaagaggatgtcagaatgccattt 
20 gcctgagagatgcaggcttcattffigatactmttamgtaacctatatagtataggatttmttgtcattttgttt 
gcttgctcctgatcagcctatctcgcagctgatgaatatctt^ 

gtatttcccactcctcttcagagtacagaagattaagtgagaccttcgtttgtgcggatcccccacacaccatagcttcaaaatg 
tttctactccttttttactcttccagattttctcggactccgcgcatcgccgtaccacttcaaaacacccaagcacagcatactaa 
attttccctctttcttcctctagggtgtc^ 
25 ttcgtcgaaaaaggcaataaaaatttttatcacgmctt^ 

ttaagttaataaacggtcttcaatttctcaagtttcagtttca^^ 

gcatagcaatctaatctaaggggcggtgttgacaattaatcatcggcatagtatatcggcatagtataatacgacaaggtgag 

gaactaaaccatggccaagttgaccagtgccgttccggtgctcaccgcgcgcgacgtcgccggagcggtcgagttctgga 
ccgaccggctcgggftctcccgggacttcgtggaggacgac^^^ 
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gcgcggtccaggaccaggtggtgccggacaacaccctggcctgggtgtgggtgcgcggcctggacgagctgtacgccg 

agtggtcggaggtcgtgtccacgaacttccgggacgcctccgggccggccatgaccgagatcggcgagcagccgtggg 

ggcgggagttcgccctgcgcgacccggccggcaactgcgtgcacttcgtggccgaggagcaggactgacacgtccgac 

ggcggcccacgggtcccaggcctcggagatccgtcccccttttcctttgtcgatatcatgtaattagttatgtcacgcttacatt 

cacgccctccccccacatccgctctaaccgaaaaggaaggagttagacaacctgaagtctaggtccctatttatttttttatagt 

tatgttagtattaagaacgttaWatamcaaatttttcttttttttctgtacagacgcgtgtacgcatgtaa^ 

ttgcttgagaaggttttgggacgctcgaaggctttaatttgcaagctggagaccaacatgtgagcaaaaggccagcaaaag 

gccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgc 

tcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcct 

gttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcaatgctcacgctgtagg 

tatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatc 

cggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcag 

agcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatct 

gcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtg 

1 5 gtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgc 

tcagtggaacgaaaactcacgttaagggattttggtcatgagatcagatctaacatccaaagacgaaaggttgaatgaaacct 

ttttgccatccgacatccacaggtccattctcacacataagtgccaaacgcaacaggaggggatacactagcagcagaccg 

ttgcaaacgcaggacctccactcctcttctcctcaacacccacttttgccatcgaaaaaccagcccagttattgggcttgattg 

gagctcgctcattccaattccttctattaggctactaacaccatgactttattagcctgtctatcctggcccccctggcgaggttc 

20 atgtttgtttatttccgaatgcaacaagctccgcattacacccgaacatcactccagatgagggctttctgagtgtggggtcaa 

atagtttcatgttccccaaatggcccaaaactgacagtttaaacgctgtcttggaacctaatatgacaaaagcgtgatctcatcc 

aagatgaactaagtttggttcgttgaaatgctaacggccagttggtcaaaaagaaacttccaaaagtcggcataccgtttgtct 

tgtttggtattgattgacgaatgctcaaaaataatctcattaatgcttagcgcagtctctctatcgcttctgaaccccggtgcacct 

gtgccgaaacgcaaatggggaaacacccgctttttggatgattatgcattgtctccacattgtatgcttccaagattctggtgg 

gaatactgctgatagcctaacgttcatgatcaaaatttaactgttctaacccctacttgacagcaatatataaacagaaggaag 

ctgccctgtcttaaaccttttttmatcatcattattagcttactttcataattgcgactggttccaattgacaagcttttgattttaacg 

acttttaacgacaacttgagaagatcaaaaaacaactaattattcgaaacgaggaattcacgtggcccagccggccgtctcg 

gatcggtacclcgagatggtccttttgaaatggctcgtatgccaattggtcttctttaccgctttttcgcatgcgtttaccgactat 

ctattaaagaagtgtgcgcaatctgggttttgccatagaaacagggtttatgcagaaaatattgccaaatctcatcactgctatt 
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acaaagtggacgccgagtctattgcacacgatcctttagagaatgtgcttcatgctaccataattaaaactataccaagattgg 
agggcgatgatatagccgttcagttcccattctctctctctttmacaggatcactcagtaaggttcactataaat 
agaatgccaaccaacagcagcggmg^gatctcttcacaacggttcaatgagacctggaagtacgcattcgacaagaaatt 
tcaagaggaggcgaacaggaccagtattccacaattccacttccttaagcaaaaacaaactgtgaactcattctggtcgaaa 
5 atatcttcatttttgtcactttcaaactccactgcagacacatttcatcttcgaaacggtgatgtatccgtagaaatctttgctgaac 
ctmcaattgaaagtttactggcaaaatgcgctgaaacttattgtaaacgagcaaaatttcctgaacattgaacatcatagaact 
aagcaggaaaacttcgcacacgtgctgccagaagaaacaactttcaacatgtttaaggacaatttcttgtattcaaagcatga 
ctctatgcctttggggcctgaatcggttgcgctagatttctctttcatgggttctactaatgtctacggtataccggaacatgcga 
cgtcgctaaggctgatggacacttcaggtggaaaggaaccctacaggcttttcaacgttgatgtctttgagtacaacatcggt 

10 accagtcaaccaatgtacggttcgatcccattcatgttttcatcttcgtccacatctatcttttgggtcaatgcagct^ 

ggtagacataaagtatgacaccagtaaaaataaaacgatgactcattggatctccgaaaatggtgtcatagatgtagtcatgt 
ccctggggccagatattccaactatcattgacaaatttaccgatttgactggtagaccctttttaccgcccatttcctctataggg 
taccatcaatgtagatggaattataatgatgagatggacgttctcacagtggactctcagatggatgctcafatgattccttacg 
attttatttggttggacttggagtatacgaacgacaaaaaatattttacttggaagcagcactcctttcccaatccaaaaaggctg 

1 5 ttatccaaattaaaaaagttgggtagaaatcttgtcgtactaatcgatcctcatttaaagaaagattatgaaatcagtgacagggt 
aattaatgaaaatgtagcagtcaaggatcacaatggaaatgactatgtaggtcattgctggccaggtaattctatatggattgat 
accataagcaaatatggccaaaagatttggaagtcctttttcgaacggtttatggatctgccggctgatttaactaatttattcatt 
tggaatgatatgaacgagccttcgattttcgatggcccagagaccacagctccaaaagatttgattcacgacaattacattga 
ggaaagatccgtccataacatatatggtctatcagtgcatgaagctacttacgacgcaataaaatcgatttattcaccatccgat 

20 aagcgtcctttccttctaacaagggctttttttgccggctctcaacgtactgctgccacatggactggtgacaatgtggccaatt 
gggattacttaaagatttccattcctatggttctgtcaaacaacattgctggtatgccatttataggagccgacatagctggcttt 
gctgaggatcctacacctgaattgattgcacgttggtaccaagcgggcttatggtacccattttttagagcacacgcccatata 
gacaccaagagaagagaaccatacttattcaatgaacctttgaagtcgatagtacgtgatattatccaattgagatatttcctgc 
tacctaccttatacaccatgmcataaatcaagtgtcactggamccgataatgaatccaatgtttattgaacaccctgaatttgc 

25 tgaattgtatcatatcgataaccaatmactggagtaattcaggtctattagtcaaacctgtcacggagcctggtcaatcagaaa 
cggaaatggttttcccacccggtatattctatgaattcgcatcWacactcttttataaacaatggtactgatttgatagaaaaga 
atatttctgcaccattggataaaattccattatttattgaaggcggtcacattatcactatgaaagataagtatagaagatcttcaa 
tgttaatgaaaaacgatccatatgtaatagttatagcccctgataccgagggacgagccgttggagatctttatgttgatgatg 
gagaaacttttggctaccaaagaggtgagtacgtagaaactcagttcattttcgaaaacaataccttaaaaaatgttcgaagtc 
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atattcccgagaatttgacaggcattcaccacaatactttgaggaataccaatattgaaaaaatcattatcgcaaagaataattt 
acaacacaacataacgttgaaagacagtattaaagtcaaaaaaaatggcgaagaaagttcattgccgactagatcgtcatat 
gagaatgataataagatcaccattcttaacctatcgcttgacataactgaagattgggaagttatttttgggcc 

SEQID NO: 21 pPICADElglsII 
aaattcctcgtttcgaataattagttgtttmga 

cagtcgcaattatgaaagtaagctaataatgatgataaaaaaaaaggtttaagacagggcagcttccttctgtttatatattgct 

gtcaagtaggggttagaacagttaaattttgatcatgaacgttaggctatcagcagtattcccaccagaatcttggaagcatac 

aatgtggagacaatgcataatcatccaaaaagcgggtgtttccccatttgcgtttcggcacaggtgcaccggggttcagaag 

cgatagagagactgcgctaagcattaatgagattatttttgagcattcgtcaatcaataccaaacaagacaaacggtatgccg 

actmggaagtttcttmgaccaactggccgttagcamcaacgaaccaaacttagttcatcttggatgagatcacgcTO 

atattaggttccaagacagcgtttaaactgtcagttttgggccatttggggaacatgaaactatttgaccccacactcagaaag 

ccctcatctggagtgatgttcgggtgtaatgcggagcttgttgcattcggaaataaacaaacatgaacctcgccaggggggc 

caggatagacaggctaataaagtcatggtgttagtagcctaatagaaggaattggaatgagcgagctccaatcaagcccaat 
aactgggctggttmcgatggcaaaagtgggtgttgaggagaaga 

agtgtatcccctcctgttgcgtttggcacttatgtgtgagaatggacGtgtggatgtcggatggcaaaaaggtttcattcaa 

ttcgtctttggatgttgtcgaccggctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctctt 

ccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacg 

gttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaa 

ggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcg 

aaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgctt 

accggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtag 

gtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttga 

gtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcg 

gtgctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtamggtatct^^ 

gttaccttcggaaaaagagttggtagctcttgatccggcaaa 

cagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactc 

acgttaagggatmggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatcta 

aagtatatatgagtaaacttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatc 
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catagttgcctgactccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccg 
cgagacccacgctcaccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctg 
caacmatccgcctccatccag^ctattaattgttgccgggaagctagagtaagtagttcgccagttaatagmgcgcaac 
gttgccattgctacaggcatcgtggtgtcacgct^^ 
5 gttacatgatcCcccatgttgtgcaaaaaagcggttagrt^ 
ttatcactcatggttatggcagcaetgcataattctctt^^ 

accaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacata 
gcagaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagtt 
cgatgtaacccactcgtgcacccaactgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggc 

10 aaaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaatagctccaaggcaac 
aaattgactactcagaccgacattcattcgttattgattttaaatcaacgataaacggaatggttacttgaatgatttcactttatga 
tcattgtttactaattacctaaataggattttatatggaattggaagaataagggaaatttcagatgtctgaaaaaggcgaggag 
ggtactaatcattcaagcccamcttgccagtaattgcttcataagcttcaatatacttttctttactcttgatagcaatttctgcatc 
catggctacgccctctttgccattcaatccgttggccgtcaaccaatctctgagaaactgcttatcgtaactctcttgcgatttac 

1 5 ccacttggtaagtcmtgattccaaaatctagaagaatctggagttaaaacttcatctactagtaccaattcattgtttt 

tccaaattcgaatttcgtatcagcaataatgatccccttcaaaagggcgaagttttttgcagcagaatacaactcgaccgccttg 
acagcgaccttctcacaaatgtcmacctacaatctcagcagcttgttcaatagagatgttttcatcgtgttcaccctgttcagct 
ttcgttgaaggtgtgaaaatcggagttggaaaggcgtcgctctcttgaaggttctcgtmcaaccttgactccatggacagtttt 
tgagttcttgtactctttccatgcacttccagtgatgtaacctgtgacaatggcttccaaaggtatcagtctgtgctttttt 

20 aggatcgtccctctaattgagatttgtatttttcttcagacagttttgatggtagtaaagcaaagacttccttgtcattagaagca 
ccaaatgattctttatgtagggtgccaaaaaatcaaaccagaaaactgagagctgagtcaaaatctttcccttatcaggaatac 
cgtttgtcataatcacatcgtaagcggagatacggtcagttgcgacgaacagcaagttgttctcatcgactgcataaatgtctc 
taacctttcctttggcgattaaaggtaggattccgtccagatcagtgttcacaatggacatacttggaaggatacagcaaagtg 
tgttggaagcgatgacacatggaaaggaattmcgag^cctagagtagtatattggggcggtgaaagttcagatgm 

25 cttaatactcttatactcttcaaagcgcccaagtgtttctgccaacctgacttttltctgaa^ 

aaccatgattaagttacgtgatttggcactggataaggtcgaaaaatatccgtattcataaacgattattggtaaaagttacaaa 
ataccactaattacggagaagcttagtaacagttatcatctcttggtcgattaacgcttacaatttccattcgccattcaggctgc 
gcaactgttgggaagggcgatcggtgcgggcctcttcgctattacgccagggcctcgaggcacaaacgaacgtctcactta 
atcttctgtactctgaagaggagtgggaaataccaagaaaaacatcaaactcgaatgattttcccaaacccctaccacaagat 
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attcatcagctgcgagataggctgatcaggagcaagctcgtacgagaagaaacaaaatgacaaaaaaaatcctatactatat 
aggttacaaataaaaaagtatcaaaaatgaagcctgcatctctcaggcaaatggcattctgacatcctcttgattagaatctag 
caagaccggtcttctcgtaagtgcccaacttgaactgaggaacagtcatgtctaaggctacaaactcaatgatgatgatgatg 
atggtcgacggcgctattcagatcctcttctgagatgagtttttgttcgggcccaaaaataacttcccaatcttcagttatgtcaa 
5 gcgataggttaagaatggtgatcttattatcattctcatatgacgatctagtcggcaatgaactttcttcgccattttttttgacttta 
atactgtctttcaacgttatgttgtgttgtaaattattctttgcgataatgattttttcaatattggtattcctcaaagtattgtggtgaat 
gcctgtcaaattctcgggaatatgacttcgaacattttttaaggtattgttttcgaaaatgaactgagtttctacgtactcacctcttt 
ggtagccaaaagtttctccatcatcaacataaagatctccaacggctcgtccctcggtatcaggggctataactattacatatg 
gatcgtttttcattaacattgaagatcttctatacttatctttcatagtgataatgtgaccgccttcaataaataatggaattttatcca 
10 atggtgcagaaatattcttttctatcaaatcagtaccattgtttataaaagagtgtaaagatgcgaattcatagaatataccgggt 
gggaaaaccatttccgtttctgattgaccaggctccgtgacaggtttgactaatagacctgaattactccagtaaaattggttat 
cgatatgatacaattcagcaaattcagggtgttcaataaacattggattcattatcggaaatccagtgacacttgatttatgaaac 
atggtgtataaggtaggtagcaggaaatatctcaattggataatatcacgtactatcgacttcaaaggttcattgaataagtatg 
gttctcttctcttggtgtctatatgggcgtgtgctctaaaaaatgggtaccataagcccgcttggtaccaacgtgcaatcaattca 
1 5 ggtgtaggatcctcagcaaagccagctatgtcggctcctataaatggcataccagcaatgttgtttgacagaaccataggaat 
ggaaatctttaagtaatcccaattggccacattgtcaccagtccatgtggcagcagtacgttgagagccggcaaaaaaagcc 
cttgttagaaggaaaggacgcttatcggatggtgaataaatcgattttattgcgtcgtaagtagcttcatgcactgatagaccat 
atatgttatggacggatctttcctcaatgtaattgtcgtgaatcaaatcttttggagctgtggtctctgggccatcgaaaatcgaa 
ggctcgttcatatcattccaaatgaataaattagttaaatcagccggcagatccataaaccgttcgaaaaaggacttccaaatc 
20 ttttggccatatttgcttatggtatcaatccatatagaattacctggccagcaatgacctacatagtcatttccattgtgatccttga 
ctgctacattttcattaattaccctgtcactgatttcataatctttctttaaatgaggatcgattagtacgacaagatttctacccaac 
ttttttaatttggataacagccttWggattgggaaaggagtgctgcttccaagtaaaatattttttgtcgttcgtatactccaagtc 
caaccaaataaaatcgtaaggaatcatatgagcatccatctgagagtccactgtgagaacgtccatctcatcattataattccat 
ctacattgatggtaccctatagaggaaatgggcggtaaaaagggtctaccagtcaaatcggtaaatttgtcaatgatagttgg 
25 aatatctggccccagggacatgactacatctatgacaccattttcggagatccaatgagtcatcgttttatttttactggtgtcata 
ctttatgtctacccaagtgtcagctgcattgacccaaaagatagatgtggacgaagatgaaaacatgaatgggatcgaaccg 
tacattggttggctggtaccgatgttgtactcaaagacatcaacgttgaaaagcctgtagggttcctttccacctgaagtgtcca 
tcagccttagcgacgtcgcatgttccggtataccgtagacattagtagaacccatgaaagagaaatctagcgcaaccgattc 
aggccccaaaggcatagagtcatgctttgaatacaagaaattgtccttaaacatgttgaaagttgtttcttctggcagcacgtgt 
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gcgaagtmcctgcttagttctatgatgttcaatgttcaggaaatmgctcgWacaataagtttcagcgcattttgccagtaaa 
cmcaattgaaaaggttcagcaaagamctacggatacatcaccgmcgaagatgaaatgtgtctgcagtggagtttgaaag 
tgacaaaaatgaagatatmcgaccagaatgagttcaca gU lg LU LLgcttaaggaagtggaattgtggaatactggtcct^ 
tcgcctcctcttgaaatttcttgtcgaatgcgtacttccaggtctcattgaaccgttgtgaagagatcaacaaacc 
5 gttggcattctctcmctcatttatagtgaaccttactgagtgatcctgtaaaaaagagagagagaatgggaactgaacggcta 
tatcatcgccctccaatcttggtatagtmaattatggte^ 

tccacmgtaatagcagtgatgagatttggcaatattttctgcataaaccctgtttctatggcaaaacccagattgcgcacactt 

ctttaatagatagtcggtaaacgcatgcgaaaaagcggtaaagaagaccaattggcatacgagccatttcaaaaggaccatc 
teg 

10 

SEQ ID NO: 22 pYPTlZAGLSII 

cgaacaaaaactcatctcagaagaggatctgaatagcgccgtcgaccatcatcatcatcatcattgagtttgtagccttagac 
atgactgttcctcagttcaagttgggcacttacgagaagaccggtcttgctagattctaatcaagaggatgtcagaatgccattt 
gcctgagagatgcaggcttcattmgatacttttttam 
15 gcttgctcctgatcagcctatctcgcagctgatgaatatc^ 

gtamcccactcctcttcagagtacagaagattaagtgagaccttcgtttgtgcggatcccccacacaccatagcttcaaaatg 
tttctactccttttttactcttccagattttctcg 

amtccctcmcttcctctagggtgtcgttaanacccgtactaaaggtttggaaaagaaaaaagagaccgcctcgmcttW 
ttcgtcgaaaaaggcaataaaaatttttatcacgtttcttmcttgaaamtmtmagttttmctctttc 

20 ttaagttaataaacggtcttcaaWctcaagmcagmcatltttcttgttctattacaactmmacttcttgttc 

gcatagcaatctaatctaaggggcggtgttgacaattaatcatcggcatagtatatcggcatagtataatacgacaaggtgag 
gaactaaaccatggccaagttgaccagtgccgttccggtgctcaccgcgcgcgacgtcgccggagcggtcgagttctgga 
ccgaccggctcgggttctcccgggacttcgtggaggacgacttcgccggtgtggtccgggacgacgtgaccctgttcatca 
gcgcggtccaggaccaggtggtgccggacaacaccctggcctgggtgtgggtgcgcggcctggacgagctgtacgccg 

25 agtggtcggaggtGgtgtccacgaacttccgggacgcctccgggccggccatgaccgagatcggcgagcagccgtggg 
ggcgggagttcgccctgcgcgacccggccggcaactgcgtgcacttcgtggccgaggagcaggactgacacgtccgac 
ggcggcccacgggtcccaggcctcggagatccgtcccccttttcctttgtcgatatcatgtaattagttatgtcacgcttacatt 
cacgccctccccccacatccgctctaaccgaaaaggaaggagttagacaacctgaagtctaggtccctatttatttttttata 
tatgttagtattaagaacgttatttatamcaaattmctttttmrt 
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ttgcttgagaaggttttgggacgctcgaaggcmaatttgcaagctggagaccaacatgtgagcaaaaggccagcaaaag 

gccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgc 

tcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcct 
gttccgaccctgccgcttaccggatacctgt^ 

5 tatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatc 
cggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcag 
agcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatct 
gcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctg^agcggtg 
gtmwgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatccmgatcttttctacggggtctgacgc 
10 tcagtggaacgaaaactcacgttaagggatmggtcatgagatcagatctatgatgagtcacaatctgcttccacagacga^ 

acaaggacaggcaaaaggaattggaagaagttgctaacccaataatgagcaagttctatggagctgctggtggagctcctg 
gtggagctcctggtggcttccctggaggmccctg^ 

ggcggagacagcggaccaaccgtggaagaagtcgattaagcaattcaacggataaattctggttaatatatataacgtgaat 

aggaaattaaggaaattttggatctaataatgtgctgtatgccgacatcgggcatcgtagattgtatagtatcgctgacactata 

15 ataagccagccaaaacccctaaaccagttgccctccactaattagtgtactacccaatcttgcctcttcgggtgtctmataag 

gacagattcacaagctcttgttgcccaatacacacatacacacagagataatagcagtcgaattcacgtggcccagccggc 
cgtctcggatcggtacctcgagatggtcct^ 

cgactatctattaaagaagtgtgcgcaatctgggttttgccatagaaacagggtttatgcagaaaatattgccaaatctcatcac 
tgctattacaaagtggacgccgagtctattgcacacgatcctttagagaatgtgcttcatgctaccataattaaaactataccaa 

20 gattggagggcgatgatatagccgttcagttcccattctctctctcttttttacaggatcactcagtaaggttcactataaatgag 
aaagagagaatgccaaccaacagcagcggtttgttgatctcttcacaacggttcaatgagacctggaagtacgcattcgaca 
agaaatttcaagaggaggcgaacaggaccagtattccacaattccacttccttaagcaaaaacaaactgtgaactcattctgg 
tcgaaaatatcttcattmgtcacmcaaactccactgcagacacatttcatcttcgaaacggtgatgtatccgtagaaatcm 
ctgaaccttttcaattgaaag^actggGaaaatgcgctgaaacttattgtaaacgagcaaaamcctgaacattgaacatcat 

25 agaactaagcaggaaaacttcgcacacgtgctgccagaagaaacaactttcaacatgtttaaggacaatttcttgtattcaaa 

gcatgactctatgccmggggcctgaatcggttgcgctagamctcWcatgggttctactaatgtctacggtataccggaac 

atgcgacgtcgctaaggctgatggacacttcaggtggaaaggaaccctacaggcttttcaacgttgatgtctttgagtacaac 

atcggtaccagccaaccaatgtacggttcgatcccattcatgttttcatcttcgtccacatctatcttttgggtcaatgcagctga 

cacttgggtagacataaagtatgacaccagtaaaaataaaacgatgactcattggatctccgaaaatggtgtcatagatgtag 
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tcatgtccctggggccagatattccaactatcattgacaaamaccgamgactggtagaccctttttaccgcccatttcctcta 
tagggtaccatcaatgtagatggaattataatgatgagatggacgttctcacagtggactctcagatggatgctcatatgattcc 
ttacgattttatttggttggacttggagtatacgaacgacaaaaaatatmacttggaagcagcactcctttcccaatccaaaaa 
ggctgttatccaaattaaaaaagttgggtagaaatcttgtcgtactaatcgatcctcatttaaagaaagattatgaaatcagtgac 
5 agggtaattaatgaaaatgtagcagtcaaggatcacaatggaaatgactatgtaggtcattgctggccaggtaattctatatg^ 
attgataccataagcaaatatggccaaaagatttggaagtccttmcga 

ttcaWggaatgatatgaacgagccttcgattttcgatggcccagagaccacagctccaaaagatttgattcacgacaattac 
attgaggaaagatccgtccataacatatatggtctatcagtgcatgaagctacttacgacgcaataaaatcgatttattcaccat 
ccgataagcgtcctttccttctaacaagggctttttttgccgg 

10 caattgggattacttaaagatttccattcctatggttctgtcaaacaacattgctggtatgccatttataggagccgacatagctg 
gctttgctgaggatcctacacctgaattgattgcacgttggxaccaagcgggcttatggtacccattttttagagcacacgccc 
atatagacaccaagagaagagaaccatacttattcaatgaacctttgaagtcgatagtacgtgatattatccaattgagatattt 
cctgctacctaccttatacaccatgtttcataaatcaagtgtcactggatttccgataatgaatccaatgtttattgaacaccctga 
atttgctgaattgtatcatatcgataaccaattttactggagtaattcaggtctattagtcaaacctgtcacggagcctggtcaatc 

15 agaaacggaaatggttttcccacccggtatattetatgaattcgcatctttacactcttttataaacaatggtactgattt^ 

aaagaatatttctgcaccattggataaaattccattatttattgaaggcggtcacattatcactatgaaagataagtatagaagat 
cttcaatgttaatgaaaaacgatccatatgtaatagttatagcccctgataccgagggacgagccgttggagatctttatgttga 
tgatggagaaactWggctaccaaagaggtgagtacgtagaaactcagttcatfttcgaaaacaataccttaaaaaatgttcg 
aagtcatattcccgagaatttgacaggcattcaccacaatactttgaggaataccaatattgaaaaaatcattatcgcaaagaat 

20 aatttacaacacaacataacgttgaaagacagtattaaagtcaaaaaaaatggcgaagaaagttcattgccgactagatcgtc 
atatgagaatgataataagatcaccattcttaacctatcgcttgacataactgaagattgggaagttatttttgggcc 

SEQ ID NO: 23 pYPTlADElglsII 

gtcgaccggctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctca 
25 ctgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatc 
aggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctgg 
cgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacagga 
ctataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtcc 
gcctttctcccttcgggaagcgtggcgcmctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagc 
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tgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaag 
acacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgcta^ 

gaagtggtggcctaactacggctac^ctagaaggacagtaWggtatctgcgctctgctgaagccagttacctt^ 
gagttggtagctcttgatccggcaaacaaaccaccgctg^ 

aaaaaaggatctcaagaagatcctttgatctmctacggggtctgacgctcagtggaacgaaaactcacgttaagggatt^ 
gtcatgagattatcaaaaaggatcttcacctagatcctmaaatte^^ 

acttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgc^ 
cccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctca 
ccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctcc 
atccagtclattaattgttgccgggaagctagagtaagtagtfc^ 

gcatcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatccccca 
tgttgtgcaaaaaagcggttagctccttcggtcctc^ 

ggcagcactgcataattctcttactgtcatgccatccgtaa 

aatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagt 

gGtcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcg 

tgcacccaactgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaa 

agggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaatagctccaaggcaacaaattgactactcaga 

ccgacattcattcgttattgattttaaatcaacgataaacggaatggttacttgaatgatttcactttatgatcattgtttactaattac 

ctaaataggattttatatggaattggaagaataagggaaatttcagatgtctgaaaaaggcgaggagggtactaatcattcaa 

gcccamcttgccagtaattgcttcataagcttcaatatacttttctttactcttgatagcaatttctgcatccatggctacgccctct 

ttgccattcaatccgttggccgtcaaccaatctctgagaaactgcttatcgtaactctcttgcgatttacccacttggtaagtctttt 

gattccaaaatctagaagaatctggagttaaaacttcatctactagtaccaattcattgttttcgtccagtccaaattcgaatttcgt 

atcagcaataatgatccccttcaaaagggcgaagttttttgcagcagaatacaactcgaccgccttgacagcgaccttctcac 

aaatgtctttacctacaatctcagcagcttgttcaatagagatgttttcatcgtgttcaccctgttcagcmcgttgaaggtgtgaa 

aatcggagttggaaaggcgtcgctctcttgaaggttctcgtmcaaccttgactccatggacagtttttgagttctt^ 

catgcacttccagtgatg^aacctctgacaatggcttccaaaggtatcagtctgtgcttttttactatcaag 

tgagatttgtatttttcttcagacagttttgatggtagtaaagcaaagacttccttgtcattagaagcaaccaaatgattctttatgta 

gggtgccaaaaaatcaaaccagaaaactgagagctgagtcaaaatctttcccttatcaggaataccgtttgtcataatcacatc 

gtaagcggagatacggtcagttgcgacgaacagcaagttgttctcatcgactgcataaatgtctctaacctttcctttggcgatt 
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aaaggtaggattccgtccagatcagtgttcacaatggacatacttggaaggatacagcaaagtgtgttggaagcgatgacac 

atggaaaggaattmcgagmcctagagtagtatattggggcggtgaaag^ 

aaagcgcccaagtgmctgccaacctgactttmctgaataatgaatc 

gamggcactggataaggtcgaaaaatatccgtattcataaacgattattggtaaaagttacaaaataccactaattacggag 
5 aagcttagtaacagttatcatctcttggtcgattaacgcttacaatttccattcgccattcaggctgcgcaactgttgggaaggg 
cgatcggtgcgggcctcttcgctattacgccagggcctcgaggcacaaacgaacgtctcacttaatcttctgtactctgaaga 
ggagtgggaaataccaagaaaaacatcaaactcgaatgattttcccaaacccctaccacaagatattcatcagctgcgagat 
a gg ct g atca gg a g c ^agctcgtacgagaagaaacaaaatgacaaaaaaaatcctatactatataggttacaaataaaaaa 
gtatcaaaaatgaagcctgcatctctcaggcaaatggcattctgacatcctcttgattagaatctagcaagaccggtcttctcgt 
10 aagtgcccaacttgaactgaggaacagtcatgtctaaggctacaaactcaatgatgatgatgatgatggtcgacggcgctatt 
cagatcctcttctgagatgag^ttttgttcgggcccaaaaataacttcccaatcttcagttatgtcaagcgataggttaagaatgg 
tgatcttattatcattctcatatgacgatctagtcgg 

ttgtgttgtaaattattcmgcgataatgatttmcaatattggtattcctcaaagtattgtggtgaatgcctgt 
atatgacttcgaacattttttaaggtattgttttcgaaaatgaactgag^ 

1 5 atcatcaacataaagatctccaacggctcgtccctcggtatcaggggctataactattacatatggatcgttmcattaacattg 
aagatcttctatacttatctttcatagtgataatgtgaccgccttcaataaataatggaattttatccaatggtgcagaaatattcttt 
tctatcaaatcagtaccattgtttataaaagagtgtaaagatgcgaattcatagaatataccgggtgggaaaaccatttccgtttc 
tgattgaccaggctccgtgacaggtttgactaatagacctgaattactccagtaaaattggttatcgatatgatacaattcagca 
aattcagggtgttcaataaacattggattcattatcggaaatccagtgacacttgatttatgaaacatggtgtataaggtaggta 

20 gcaggaaatatctcaattggataatatcacgtactatcgacttcaaaggttcattgaataagtatggttctcttctcttggtgtctat 
^gggcgtgtgct^aaaaaatgggtaccataagcccgcttggtaccaacgtgcaatcaattcaggtgtaggatcctcagca 
aagccagctatgtcggctcctataaatggcataccagcaatgttgtttgacagaaccataggaatggaaatctttaagtaatcc- 
caattggccacattgtcaccagtccatgtggcagcagtacgttgagagccggcaaaaaaagcccttgttagaaggaaagga 
cgcttatcggatggtgaataaatcgattttattgcgtcgtaagtagcttcatgcactgatagaccatatatgttatggacggatctt 

25 tcctcaatgtaattgtcgtgaatcaaatcttttggagc^ 

aatgaataaattagttaaatcagccggcagatccataaaccgttcgaaaaaggacttccaaatcttttggccatatttgcttatg 
gtatcaatccatatagaattacctggccagcaatgacctacatagtcatttccattgtgatccttgactgctacattttcattaatta 
ccctgtcactgatttcataatcmcmaaatgaggatcgattagtacgacaagatttctacccaacttttttaatttggataacagc 
cmttggattgggaaaggagtgctgcttccaagtaaaatattttttgtcgttcgtatactccaagtccaaccaaataaaatcgtaa 
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ggaatcatatgagcatccatctgagagtccactgtgagaacgtccatctcatcattataattccatctacattgatggtaccctat 
agaggaaatgggcggtaaaaagggtctaccagtcaaatcggtaaatttgtcaatgatagttggaatatctggccccagggac 
atgactacatctatgacaccattttcggagatccaatg^ 

agctgcattgacccaaaagatagatgtggacgaagatgaaaacatgaatgggatcgaaccgtacattggttggctggtacc 
5 gatgttgtactcaaagacatcaacgttgaaaagcctgtagggttcctttccacctgaagtgtccatcagccttagcgacgtcgc 
atgttccggtataccgtagacattagtagaacccatgaaagagaaatctagcgcaaccgattcaggccccaaaggcataga 
gtcatgctttgaatacaagaaattgtccttaaacatgttgaa 
ctatgatgttcaatgttcaggaaatmgctcgmacaataag^ 

caaagatttctacggatacatcaccgtttcgaagatgaaatgtgtctgcagtggagtttgaaagtgacaaaaatgaagatatttt 
. 1 0 cgaccagaatgagttcacagmgttmgcttaaggaagtggaattg 

gtcgaatgcgtacttccaggtctcattgaaccgttgtgaagagatcaacaaaccgctgctgttggttggcattctctctttctcat 
ttatagtgaaccttactgagtgatcctgtaaaaaagagagagagaatgggaactgaacggctatatcatcgccctccaatctt 
ggtatagttttaattatggtagcatgaagcacattctctaaaggatcgtgtgcaatagactcggcgtccactttgtaatagcagt 
gatgagamggcaatattttctgcataaaccctgtttctatggcaaaacccagattgcgcacacttctttaatagatagtcggta 

15 aacgcatgcgaaaaagcggtaaagaagaccaattggcatacgagccatttcaaaaggaccatctcgaggtaccgatccga 
gacggccggctgggccacgtgaattcgactgctattatctctg^gtgtatgtgtgtattgggcaacaagagcttgtgaatct 
ccttataaaagacacccgaagaggcaagattgggtagtacactaattagtggagggcaactggtttaggggttttggctggc 
ttattatagtgtcagcgatactatacaatctacgatgcccgatgtcggcatacagcacattattagatccaaaatttccttaatttc 
ctattcacgttatatatattaaccagaatttatccgttgaattgcttaatcgacttcttccacggttggtccgctgtctccgcctggg 

20 gcagcaccacctggggcaccgccagctgcgccagctccgccagggaaacctccagggaagccaccaggagctccacc 
aggagctccaccagcagctccatagaacttgctcattattgggttagcaacttcttccaattccttttgcctgtccttgtactcgtc 
tgtggaagcagattgtgactcatcatagatctgatctcat 

SEQ ID NO; 24 p G APZAgl sIIHDEL 
25 tcgagatggtcctmgaaatggctcgtatgccaattggtcttctttaccgctmcgcatgcgmaccgactatctattaaagaa 
gtgtgcgcaatctgggttttgccatagaaacagggtttatgcagaaaatattgccaaatctcatcactgctattacaaagtgga 
cgccgagtctattgcacacgatcctttagagaatgtgcttcatgctaccataatt^aaactataccaagattggagggcgatga 
tatagccgttcagttcccattctctctctcttttttacaggatcactcagtaaggttcactataaatgagaaagagagaatgccaa 
ccaacagcagcggtttgttgatctcttcacaacggttcaatgagacctggaagtacgcattcgacaagaaatttcaagagga 
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ggcgaacaggaccagtattccacaattccacttccttaagcaaaaacaaactgtgaactcattctggtcgaaaatatcttcattt 
ttgtcactttcaaactccactgcagacacatttcatcttcgaaacggtgatgtatccgtagaaatctttgctgaaccttttcaattg 
aaagtttactggcaaaatgcgctgaaacttattgtaaacgagcaaaatttcctgaacattgaacatcatagaactaagcagga 
aaacttcgcacacgtgctgccagaagaaacaactttcaacatgtttaaggacaatttcttgtattcaaagcatgactctatgcctt 
5 tggggcctgaatcggttgcgctagatttctctttcatgggttctactaatgtctacggtataccggaacatgcgacgtcgctaag 
gctgatggacacttcaggtggaaaggaaccctacaggctmcaacgttgatgtctttgagtacaacatcggtac^ 
caatgtacggttcgatcccattcatgttttcatcttcgtccacatctatcttttgggtcaatgcagctgacacttgggtagacataa 
agtatgacaccagtaaaaataaaacgatgactcattggatctccgaaaatggtgtcatagatgtagtcatgtccctggggcca 
gatattccaactatcattgacaaatttaccgamgactggtagaccctttttaccgcccatttcctctatagggtaccatcaatgta 

10 gatggaattataatgatgagatggacgttctcacagtggactctcagatggatgctcatatgattccttacgatmatttggttgg 
acttggagtatacgaacgacaaaaaatattttacttggaagcagcactcctttcccaatccaaaaaggctgttatccaaattaaa 
aaagttgggtagaaatcttgtcgtactaatcgatcctcatttaaagaaagattatgaaatcagtgacagggtaattaatgaaaat 
gtagcagtcaaggatcacaatggaaatgactatgtaggtcattgctggccaggtaattctatatggattgataccataagcaaa 
tatggccaaaagatttggaagtcctttttcgaacggtttatggatctgccggctgatttaactaatttattcatttggaatgatatga 

15 acgagccttcgattttcgatggcccagagaccacagctccaaaagatttgattcacgacaattacattgaggaaagatccgtc 
cataacatatatggtctatcagtgcatgaagctacttacgacgcaataaaatcgatttattcaccatccgataagcgtcctttcct 
tctaacaagggctttttttgccggctctcaacgtactgctgccacatggactggtgacaatgtggccaattgggattacttaaa 
atttccattcctatggttctgtcaaacaacattgctggtatgccatttataggagccgacatagctggctttgctgaggatcctac 
acctgaattgattgcacgttggtaccaagcgggcttatggtacccattttttagagcacacgcccatatagacaccaagagaa 

20 gagaaccatacttattcaatgaacctttgaagtcgatagtacgtgatattatccaattgagatatttcctgctacctaccttataca 
ccatgtttcataaatcaagtgtcactggamccgataatgaatccaatgtttattgaacaccctgaatttgctgaattgtatcatat 
cgataaccaattttactggagtaattcaggtctattagtcaaacctgtcacggagcctggtcaatcagaaacggaaatggttttc 
ccacccggtatattctatgaattcgcatcmacactcttttataaacaatggtactgatttgatagaaaagaatatttctgcaccat 
tggataaaattccattatttattgaaggcggtcacattatcactatgaaagataagtatagaagatcttcaatgttaatgaaaaae 

25 gatccatatgtaatagttatagcccctgataccgagggacgagccgttggagatctttatgttgatgatggagaaacttttggct 
accaaagaggtgagtacgtagaaactcagttcattttcgaaaacaataccttaaaaaatgttcgaagtcatattcccgagaattt 
gacaggcattcaccacaatactttgaggaataccaatattgaaaaaatcattatcgcaaagaataatttacaacacaacataac 
gttgaaagacagtattaaagtcaaaaaaaatggcgaagaaagttcattgccgactagatcgtcatatgagaatgataataaga 
tcaccattcttaacctatcgcttgacataactgaagattgggaagttaitttttgggcccgaacaaaaactcatctcagaagagg 
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atctgaatagcgccgtcgaccacgacgaactgtgagm 
agaagaccggtcttgctagattctaatcaagaggatgtcagaatgccamg 
ttamgtaacctatatag^ataggatttttttt^ 
gaatatcttgtggtaggggtttgggaaaatcattcgagmgat^^ 
5 aagtgaga(xttcgtttgtgcggatcccccacacaccatagcttcaaaatgmctactccttttttactcttc^ 

ctccgcgcatcgccgtaccacttcaaaacacccaagcacagcatactaaatmccctcmcttcctctagggtgtcgttaatta 
cccgtactaaaggtttggaaaagaaaaaagagaccgcctcgmcttmcttcgtcgaaaaaggcaataaaaattm 
mctlTOcttgaaalULLtUtUagtttttttctctttcagtgacctccattgataWaagtt 
agmcatttttcttgttctattacaactttttto^ 

10 aattaatcatcggcatagtatatcggcatagtataatacgacaaggtgaggaactaaaccatggccaagttgaccagtgccgt 
tccggtgctcaccgcgcgcgacgtcgccggagcggtcgagttctggaccgaccggctcgggttctcccgggacttcgtgg 
aggacgacttcgccggtgtggtccgggacgacgtgaccctgttcatcagcgcggtccaggaccaggtggtgccggacaa 
caccctggcctgggtgtgggtgcgcggcctggacgagctgtacgccgagtggtcggaggtcgtgtccacgaacttccgg 
gacgcctccgggccggccatgaccgagatcggcgagcagccgtgggggcgggagttcgccctgcgcgaccc^gccg 

15 gcaactgcgtgcacttcgtggccgaggagcaggactgacacgtccgacggcggcccacgggtcccaggcctcggagat 
ccgtcccccttttccmgtcgatatcatgtaattagttatgtcacgcttacattcacgccctccccccacatccgctctaaccgaa 
aaggaaggagttagacaacctgaagtctaggtccctatttatttttttatagttatgttagtattaagaacgttatttatatttcaaatt 
mcllUlLUctgtacagacgcgtgtacgcatgtaacattatactgaaaaccttgcttgagaaggttttgggacgctcgaaggct 
ttaatttgcaagctggagaccaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctgg 

20 cgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacagga 
ctataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtcc 
gccmctcccttcgggaagcgtggcgcmctcaatgctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagc 
tgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaag 
acacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttctt 

25 gaagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaa 
gagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgWgcaagcagcagattacgcgcaga 
aaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttg 
gtcatgcatgagatcagatctttmgtagaaatgtctt 

tgcaactccgaacgacctgctggcaacgtaaaattctccggggtaaaacttaaatgtggagtaatggaaccagaaacgtctc 
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ttcccttctctctccttccaccgcccgttaccgtccctaggaaattttactctgctggagagcttcttctacgg 
aatgctcttcccagcattacgttgcgggtaaaacggaggtcgtgtacccgacctagcagcccagggatggaaaagtcccg 
gccgtcgctggcaataaitagcgggcggacgcatgtcatgagattattggaaaccaccagaatcgaatataaaaggcgaac 
acctttcccaa^ggtttctcctgacccaaaga 
5 cgaggaattcacgtggcccagccggccgtctcggatcggtacc 

SEQ ID NO; 25 pGAPADE 1 glsIIHDEL 

cgtactcacctctttggtagccaaaagtttctccatcatcaacataaagatctccaacggctcgtccctcggtatcaggggctat 
aactattacatatggatcgtttttcattaacattgaagatcttctatacttatctttcatagtgataatgtgaccgcctt 

10 tggaatmatccaatggtgcagaaatattcttttctatcaaatcagtaccattgtttataaaagagtgtaaagatgcgaatt 

aatataccgggtgggaaaaccatttccgtttctgattgaccaggctccgtgacaggtttgactaatagacctgaattactccag 
taaaattggttatcgatatgatacaattcagcaaattcagggtgttcaataaacattggattcattatcggaaatccagtgacact 
tgatttatgaaacatggtgtataaggtaggtagcaggaaatatctcaattggataatatcacgtactatcgacttcaaaggttcat 
tgaataagtatggttctcttctcttggtgtctatatgggcgtgtgctctaaaaaatgggtaccataagcccgcttggtac 

15 gcaatcaattcaggtgtaggatcctcagcaaagccagctatgtcggctcctataaatggcataccagcaatgttgtttgacag 
aaccataggaatggaaatctttaagtaatcccaattggccacattgtcaccagtccatgtggcagcagtacgttgagagccg 
gcaaaaaaagcccttgttagaaggaaaggacgcttatcggatggtgaataaatcgattttattgcgtcgtaagtagcttcatgc 
actgatagaccatatatgttatggacggatctUcctcaatgtaattgtcgtgaatcaaatcttttggagctgtggtctctgggcca 
tcgaaaatcgaaggctcgttcatatcattccaaatgaataaattagttaaatcagccggcagatccataaaccgttcgaaaaag 

20 gacttccaaatcttttggccatatttgcttatggtatcaatccatatagaattacctggccagcaatgacctacatagtcatttccat 
tgtgatccttgactgctacatWcattaattaccctgtcactgatttcataatctttctttaaatgaggatcgattagtacgacaaga 
tttctacccaacttttttaaitttggata^ 

tactccaagtccaaccaaataaaatcgtaaggaatcatatgagcatccatctgagagtccactgtgagaacgtccatctcatc 
attataattccatctacattgatggtaccctatagaggaaatgggcggtaaaaagggtctaccagtcaaatcggtaaatttgt^ 
25 atgatagttggaatatctggccccagggacatgactacatrt^ 

ctggtgtcatactttatgtctacccaagtgtcagctgcattgacccaaaagatagatgtggacgaagatgaaaacatgaatgg 
gatcgaaccgtacattggttggctggtaccgatgttgtactcaaagacatcaacgttgaaaagcctgtagggttcctttccacc 
tgaagtgtccatcagccttagcgacgtcgcatgttccggtataccgtagacattagtagaacccatgaaagagaaatctagc 
gcaaccgattcaggccccaaaggcatagagtcatgctttgaatacaagaaattgtccttaaacatgttgaaagttgtttcttctg 
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10 



15 



20 



25 



gcagcacgtgtgcgaagtmcctgcttagttctatgatgttcaatgttcaggaaattttgctcgtttacaataagtttcagcgcatt 

ttgccagtaaactttcaattgaaaaggttcagcaaagatttctacggatacatcaccgtttcgaagatgaaatgtgtctgcagtg 

gagtttgaaagtgacaaaaatgaagatat^cgaccagaatgagttcacagtttgttmgcttaaggaagtggaattgtgga^^ 

actggtcctgttcgcctcctcttgaaamcttgtcgaatgcgtacttccaggtctcattgaaccgttgtgaagagato 

cgctgctgttggttggcattctctctttctcatttatagtgaaccttactgagtgatcctgtaaaaaagagagagagaatgggaa 

ctgaacggctatatcatcgccctccaatcttggtatagtmaattatggtagcatgaagcacattctctaaaggatcgtgtgca^ 

tagactcggcgtccactttgtaatagcagtgatgagatttggcaatattttctgcataaaccctgtttGtatggcaaaacccagat 

tgcgcacacttctttaatagatagtcggtaaacgcatgcgaaaaagcggtaaagaagaccaattggcatacgagccatttca 

aaaggaccatctcgaggtaccgatccgagacggccggctgggccacgtgaattcctcgtttcgaaatagttgttcaattgatt 

gaaatagggacaaataaattaaatttaaagtctttgggtcaggagaaaccaaaattgggaaaggtgttcgccttttatattcgat 

tctggtggtttccaataatctcatgacatgcgtccgcccgctattattgccagcgacggccgggacttttccatccctgggctg 

ctaggtcgggtacacgacctccgttttacccgcaacgtaatgctgggaagagcattgctgcaagggggccgtagaagaag 

ctctccagcagagtaaaatttcctagggacggtaacgggcggtggaaggagagagaagggaagagacgtttctggttcca 

ttactccacatttaagttttaccccggagaattttacgttgccagcaggtcgttcggagttgcaacggagccagatatttcagag 

atggctacctgattggacgaggacaccaagacatttctacaaaaaagatctgatctcatcgaccggctgcattaatgaatcgg 

ccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctcactgactcgctgcgcteggtcgttcgg 

ctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacat 

gtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctg 

acgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccct 

ggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggc 

gctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgt' 

tcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagca 

gccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctac 

actagaaggacagtamggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaa 

acaaacbaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctt 

tgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatctt 

cacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgctta 

atcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactccccgtcgtgtagataactacgatac 

gggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcaccggctccagatttatcagcaataa 
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accagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctccatccagtctattaattgttgccggg 
aagctagagtaagtagttcgccagttaatagmgcgcaacgttgttgccattgctacaggcatcgtggtgtcacgctcgtcgtt 
tggtatggcttcattcagctccggttcccaacgate 
cttcggtcctccgatcgttgtcagaagtaa^ 
5 tcatgccatccgtaagatgcttttctg^ 

gctcttgcGcggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatcattggaaaacgttcttcg 
gggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaactgatcttcagcatct 
tttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaa 
atgttgaatactcatactcttccttWcaatagctccaaggcaacaaattgactactcagaccgacattcattcgttattgatm 

10 atcaacgataaacggaatggttacttgaatgatttcactttatgatcattgtttactaattacctaaataggattttatatggaattgg 
aagaataagggaaatttcagatgtctgaaaaaggcgaggagggtactaatcattcaagcccatttcttgccagtaattgcttca 
taagcttcaatatacttttctttactcttgatagcaatttctgcatccatggctacgccctctttgccattcaatccgttggccgtcaa 
ccaatctctgagaaactgcttatcgtaactctcttgcgamacccacttggtaagtcttttgattccaaaatctagaagaatctgg 
agttaaaacttcatctactagtaccaattcattgttttcgtccagtccaaattcgaatttcgtatcagcaataatgatccccttcaaa 

15 , agggcgaagtttmgcagcagaatacaactcgaccgccttgacagcgaccttctcacaaatgtcmacctacaatctcag 
gcttgttcaatagagatgtMcatcgtgttcaccctgttcagctttcgttgaaggtgtgaaaatcggagttggaaaggcgtcg 
ctcttgaaggttctcgttttcaaccttgactccatggacagttmgagttcttgtactcmceatgcacttccagtgatgtaac 
tgacaatggcttccaaaggtatcagtctgtgctmttactatcaaggatcgtccctctaattgagatttgtattmc^ 
tttgatggtagtaaagcaaagacttccttgtcattagaagcaaccaaatgattctttatgtagggtgccaaaaaatcaaaccag 

20 aaaactgagagctgagtcaaaatctttcccttatcaggaataccgtttgtcataateacatcgtaagcggagatacggtcagtt 
gcgacgaacagcaagttgttctcatcgactgcataaatgtctctaacctttcctttggcgattaaaggtaggattccgtccagat 
cagtgttcacaatggacatacttggaaggatacagcaaagtgtgttggaagcgatgacacatggaaaggaatttttcgagttt 
cctagagtagtatattggggcggtgaaagttcagatgtttaatgcttaatactcttatactcttcaaagcgcccaagtgtttctgc 
caacctgactttmctgaataatgaatcgttcaagtggagtatttaaaccatgattaag^acgtgattt 

25 gaaaaatatccgtattcataaacgattattggtaaaagttacaaaataccactaattacggagaagcttagtaacagttatcatct 

cttggtcgattaacgcttacaatttccattcgccattcaggctgcgcaactgttgggaagggcgatcggtgcgggcctcttcg 

ctattacgccagggcctcgaggcacaaacgaacgtctcacttaatcttctgtactctgaagaggagtgggaaataccaagaa 

aaacatcaaactcgaatgattttcccaaacccctaccacaagatattcatcagctgcgagataggctgatcaggagcaagct 

cgtacgagaagaaacaaaatgacaaaaaaaatcctatactatataggttacaaataaaaaagtatcaaaaatgaagcctgca 
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tctctcaggcaaatggcattctgacatcctcttgattagaatctagcaagaccggtcttctcgtaagtgcccaacttgaactgag 

gaacagtcatgtctaaggctaaaactcacagttcgtcgtggtcgacggcgctattcagatcctcttctgagatgagtttttgttc 

gggcccaaaaataacttcccaatcttcagttatgtcaagcgataggttaagaatggtgatcttattatcattctcatatgacgatc 
tagtcggcaatgaactttcttcgccatttttmgactttaatactgte 

5 atttmcaatattggtattcctcaaagtattgtggtgaatgcctgtcaaattctcgggaatatgacttcgaacatttmaag^ 
ttttcgaaaatgaactgagtttcta 

SEQIDNO:26 

GACGAGATCTTTTTTTCAGACCATATGACCGG 

10 

SEQ ID NO: 27 

GCGGAATTCTTTTCTCAGTTGATTTGTTTGT 
SEQ ID NO: 28 

15 GCG GGT CGA CCA CGA CGA ACT GTG AGT TTT AGC CTT AGA CAT GAC 
SEQ D3 NO: 29 

CAG GAG CAAA GCT CGT ACG AG 

20 
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We claim: 

1. A vector capable of expressing an cc-l,2-mannosidase or a 
functional part thereof in a methylotrophic yeast strain, comprising a nucleotide 

5 sequence coding for said <x~l,2-mannosidase or said functional part. 

2. The vector of claim 1, wherein said a-l,2-mannosidase is a protein 
from a fungal species. 



10 3 - The vector of claim 2, wherein said fungus is Trichoderma 



reesei. 



4. The vector of claim 1 , wherein said cc-1 ,2-mannosidase is a protein 
from a mammalian species. 

15 5 - The vector of claim 4, wherein said a-l,2-mannosidase is murine cc- 

1 ,2-mannosidase IA or IB. 

6. The vector of claim 1, wherein said a-l,2-mannosidase or said 
functional part is tagged with an ER-retention signal. 

20 . 

7. The vector of claim 6, wherein said ER-retention signal comprises 
peptide HDEL. 

8. The vector of claim 1 3 wherein the nucleotide sequence coding for 
25 said a- 1 ,2-mannosidase or said functional part is operably linked to a promoter and a 

3' termination sequence. 
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9. The vector of claim 8, wherein said promoter is the promoter of a 
gene selected from the group consisting of AOXI, AOXH, GAP, and FLD. 

10. A vector selected from the group consisting of 

5 pGAPZMFManHDEL, pGAPZMFManMycHDEL, pPICZBMFManMycHDEL, 
pGAPZmManHDEL, pGAPZmMycManHDEL, pPIC9mMycManHDEL and 
pGAPZmMycManHDEL. 

1 1 . A vector capable of expressing a glucosidase II or a functional part 
1 0 thereof in a methylotrophic yeast strain, comprising a nucleotide sequence coding for 

said glucosidase II or said functional part 

12; The vector of claim 1 1, wherein said glucosidase II is a protein 
from a fungal species. 
15 .... 

13. The vector of claim 12, wherein said fungus is Saccharomyces 

cerevisiae. 

> ■ 

14. The vector of claim 1 1, wherein said glucosidase II is a protein 
20 from a mammalian species. 

15. The vector of claim 1 1 , wherein said glucosidase II or said 
functional part is tagged with an ER-retention signal. 

25 .16. The vector of claim 1 5, wherein said ER~reterition signal comprises 

peptide HDEL. 
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17. The vector of claim 1 1 , wherein the nucleotide sequence coding for 
said a-l,2-mannosidase or said functional part is operably linked to a promoter and a 
3' termination sequence. ; 

5 1 8 - The vector of claim 1 7, wherein said promoter is the promoter of a 

gene selected from the group consisting of AOXI, AOXII, GAP, and FLD. 

1 9. A vector having the designation pGAPZAGLSII, pPICZAGLSII, 
pAOX2ZAGLSII, pYPTIZAGLSH, pGAPADEglsII, pPICADEglsII, 

10 pAOX2ADEglsTI, pYPTIADEglsII, pGAPZAglsIIHDEL and pGAPADEglsIIHDEL. 

r 

20. A vector for disrupting the Ochl gene in a methylotrophic yeast 
strain, comprising a portion of the Ochl gene and a selectable marker gene, wherein 
said portion of the Ochl gene and said selectable marker gene are linked in such a way 

15 to effect the disruption of the genomic Ochl gene in said methylotrophic yeast strain. 

21. A vector having the designation pBLURAS'PpOCHl. 

22. A method of reducing the glycosylation on proteins produced from 
20 a methylotrophic yeast, comprising transforming said yeast with any one of the vectors 

of claims 1-21. 

23. The method of claim 22, wherein said yeast is Pichia pastoris. 

25 24. The method of claim 23, wherein said yeast is a Pichia pastoris 

strain selected from GS1 15 (NRRL Y-15851), GS190 (NRRL Y-18014), PPF1 
(NRRL Y-18017), PPY12-OH, yGC4, or derivatives thereof. 
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25. The method of claim 22, 23 or 24, wherein said yeast has been 
genetically engineered to expresses a heterologous protein. 

26. A genetically engineered strain of a methylotrophic yeast, wherein 
5 said strain is transformed with at least one of the vectors of claims 1-21. 

27. A method of reducing the glycosylation of a heterologous 
glycoprotein expressed from a methylotrophic yeast, comprising transforming cells of 
said methylotrophic yeast with at least one of the vectors of claims 1-21, and 

10 producing said glycoprotein from the transformed cells. 

28. A method of producing a glycoprotein with reduced glycosylation 
in a methylotrophic yeast, comprising transforming cells of said methylotrophic yeast 
with at least one of the vectors of claims 1-21 and with a nucleotide sequence capable 

15 of expressing said glycoprotein in said yeast, and producing said glycoprotein from the 
transformed cells. 

29. A glycoprotein produced by the method of claim 27 or 28. 

20 30. The glycoprotein of claim 29, wherein said glycoprotein has a 

reduced immunogenicity as relative to the glycoprotein produced from a wild type 
strain of said methylotrophic yeast 

31. The glycoprotein of claim 29, wherein said glycoprotein is suitable 
25 for use in human therapeutics. 

32. A kit comprising any of the vectors of claims 1-21 . 
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33. The kit of claim 32, further comprising a methylotrophic yeast 

. strain. 

34. A kit comprising the methylotrophic yeast strain of claim 26. 

5 
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Glucosidase II assay on heterologously expressed Pichia protein 
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